commit 388f6d150784d9d1d25a3b6bace00aa52a85daf3
Author: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Date:   Wed Aug 30 14:52:45 2023 +0200

    Linux 6.4.13
    
    Link: https://lore.kernel.org/r/20230828101157.383363777@linuxfoundation.org
    Tested-by: Ronald Warsow <rwarsow@gmx.de>
    Tested-by: Justin M. Forbes <jforbes@fedoraproject.org>
    Tested-by: Linux Kernel Functional Testing <lkft@linaro.org>
    Tested-by: Salvatore Bonaccorso <carnil@debian.org>
    Tested-by: Ron Economos <re@w6rz.net>
    Tested-by: Joel Fernandes (Google) <joel@joelfernandes.org>
    Tested-by: SeongJae Park <sj@kernel.org>
    Tested-by: Conor Dooley <conor.dooley@microchip.com>
    Tested-by: Bagas Sanjaya <bagasdotme@gmail.com>
    Tested-by: Sudip Mukherjee <sudip.mukherjee@codethink.co.uk>
    Tested-by: Shuah Khan <skhan@linuxfoundation.org>
    Tested-by: Florian Fainelli <florian.fainelli@broadcom.com>
    Tested-by: Guenter Roeck <linux@roeck-us.net>
    Tested-by: Jon Hunter <jonathanh@nvidia.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 734cf5795f4ba807c7418d0eaea2a9e5dda0cc48
Author: Florian Westphal <fw@strlen.de>
Date:   Thu Aug 10 23:59:03 2023 +0200

    netfilter: nf_tables: fix kdoc warnings after gc rework
    
    commit 08713cb006b6f07434f276c5ee214fb20c7fd965 upstream.
    
    Jakub Kicinski says:
      We've got some new kdoc warnings here:
      net/netfilter/nft_set_pipapo.c:1557: warning: Function parameter or member '_set' not described in 'pipapo_gc'
      net/netfilter/nft_set_pipapo.c:1557: warning: Excess function parameter 'set' description in 'pipapo_gc'
      include/net/netfilter/nf_tables.h:577: warning: Function parameter or member 'dead' not described in 'nft_set'
    
    Fixes: 5f68718b34a5 ("netfilter: nf_tables: GC transaction API to avoid race with control plane")
    Fixes: f6c383b8c31a ("netfilter: nf_tables: adapt set backend to use GC transaction API")
    Reported-by: Jakub Kicinski <kuba@kernel.org>
    Closes: https://lore.kernel.org/netdev/20230810104638.746e46f1@kernel.org/
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ae2d1461ebccba1596b08f8478bf5c9bed854184
Author: Günther Noack <gnoack3000@gmail.com>
Date:   Tue Aug 8 22:11:12 2023 +0200

    TIOCSTI: Document CAP_SYS_ADMIN behaviour in Kconfig
    
    commit 3f29d9ee323ae5cda59d144d1f8b0b10ea065be0 upstream.
    
    Clarifies that the LEGACY_TIOCSTI setting is safe to turn off even
    when running BRLTTY, as it was introduced in commit 690c8b804ad2
    ("TIOCSTI: always enable for CAP_SYS_ADMIN").
    
    Signed-off-by: Günther Noack <gnoack3000@gmail.com>
    Reviewed-by: Samuel Thibault <samuel.thibault@ens-lyon.org>
    Link: https://lore.kernel.org/r/20230808201115.23993-1-gnoack3000@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 259ff81cee3c8e8b2b06be7cc31ae401247f3101
Author: Arnd Bergmann <arnd@arndb.de>
Date:   Mon Jun 5 10:58:29 2023 +0200

    ASoC: amd: vangogh: select CONFIG_SND_AMD_ACP_CONFIG
    
    commit fd0a7ec379dbf21b7bfd81914381ae5281706ef5 upstream.
    
    The vangogh driver just gained a link time dependency that now causes
    randconfig builds to fail:
    
    x86_64-linux-ld: sound/soc/amd/vangogh/pci-acp5x.o: in function `snd_acp5x_probe':
    pci-acp5x.c:(.text+0xbb): undefined reference to `snd_amd_acp_find_config'
    
    Fixes: e89f45edb747e ("ASoC: amd: vangogh: Add check for acp config flags in vangogh platform")
    Signed-off-by: Arnd Bergmann <arnd@arndb.de>
    Link: https://lore.kernel.org/r/20230605085839.2157268-1-arnd@kernel.org
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit d641fa9fc8fc2c4890eeccaed61ca54f959f0844
Author: Liam R. Howlett <Liam.Howlett@oracle.com>
Date:   Fri Aug 18 20:43:55 2023 -0400

    maple_tree: disable mas_wr_append() when other readers are possible
    
    [ Upstream commit cfeb6ae8bcb96ccf674724f223661bbcef7b0d0b ]
    
    The current implementation of append may cause duplicate data and/or
    incorrect ranges to be returned to a reader during an update.  Although
    this has not been reported or seen, disable the append write operation
    while the tree is in rcu mode out of an abundance of caution.
    
    During the analysis of the mas_next_slot() the following was
    artificially created by separating the writer and reader code:
    
    Writer:                                 reader:
    mas_wr_append
        set end pivot
        updates end metata
        Detects write to last slot
        last slot write is to start of slot
        store current contents in slot
        overwrite old end pivot
                                            mas_next_slot():
                                                    read end metadata
                                                    read old end pivot
                                                    return with incorrect range
        store new value
    
    Alternatively:
    
    Writer:                                 reader:
    mas_wr_append
        set end pivot
        updates end metata
        Detects write to last slot
        last lost write to end of slot
        store value
                                            mas_next_slot():
                                                    read end metadata
                                                    read old end pivot
                                                    read new end pivot
                                                    return with incorrect range
        set old end pivot
    
    There may be other accesses that are not safe since we are now updating
    both metadata and pointers, so disabling append if there could be rcu
    readers is the safest action.
    
    Link: https://lkml.kernel.org/r/20230819004356.1454718-2-Liam.Howlett@oracle.com
    Fixes: 54a611b60590 ("Maple Tree: add new data structure")
    Signed-off-by: Liam R. Howlett <Liam.Howlett@oracle.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 163d62238efc4d73d1fff0200a8b3822e16ef90a
Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Wed Aug 23 20:11:49 2023 -0500

    ASoC: amd: yc: Fix a non-functional mic on Lenovo 82SJ
    
    [ Upstream commit c008323fe361bd62a43d9fb29737dacd5c067fb7 ]
    
    Lenovo 82SJ doesn't have DMIC connected like 82V2 does.  Narrow
    the match down to only cover 82V2.
    
    Reported-by: prosenfeld@Yuhsbstudents.org
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=217063
    Fixes: 2232b2dd8cd4 ("ASoC: amd: yc: Add Lenovo Yoga Slim 7 Pro X to quirks table")
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com
    Link: https://lore.kernel.org/r/20230824011149.1395-1-mario.limonciello@amd.com
    Signed-off-by: Mark Brown <broonie@kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 45bb78bc2f57d548b685404969d2c999fb7d8598
Author: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
Date:   Tue Aug 22 21:29:43 2023 +0200

    gpio: sim: pass the GPIO device's software node to irq domain
    
    [ Upstream commit 6e39c1ac688161b4db3617aabbca589b395242bc ]
    
    Associate the swnode of the GPIO device's (which is the interrupt
    controller here) with the irq domain. Otherwise the interrupt-controller
    device attribute is a no-op.
    
    Fixes: cb8c474e79be ("gpio: sim: new testing module")
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 14540aa3eaba8e3977249f9fc13cc665db12269f
Author: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
Date:   Tue Aug 22 21:29:42 2023 +0200

    gpio: sim: dispose of irq mappings before destroying the irq_sim domain
    
    [ Upstream commit ab4109f91b328ff5cb5e1279f64d443241add2d1 ]
    
    If a GPIO simulator device is unbound with interrupts still requested,
    we will hit a use-after-free issue in __irq_domain_deactivate_irq(). The
    owner of the irq domain must dispose of all mappings before destroying
    the domain object.
    
    Fixes: cb8c474e79be ("gpio: sim: new testing module")
    Signed-off-by: Bartosz Golaszewski <bartosz.golaszewski@linaro.org>
    Reviewed-by: Andy Shevchenko <andriy.shevchenko@linux.intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit c6e1fcd50cbc7510c9ec220137aa4ccab4eb3ed8
Author: Rob Clark <robdclark@chromium.org>
Date:   Fri Aug 18 07:59:38 2023 -0700

    dma-buf/sw_sync: Avoid recursive lock during fence signal
    
    [ Upstream commit e531fdb5cd5ee2564b7fe10c8a9219e2b2fac61e ]
    
    If a signal callback releases the sw_sync fence, that will trigger a
    deadlock as the timeline_fence_release recurses onto the fence->lock
    (used both for signaling and the the timeline tree).
    
    To avoid that, temporarily hold an extra reference to the signalled
    fences until after we drop the lock.
    
    (This is an alternative implementation of https://patchwork.kernel.org/patch/11664717/
    which avoids some potential UAF issues with the original patch.)
    
    v2: Remove now obsolete comment, use list_move_tail() and
        list_del_init()
    
    Reported-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl>
    Fixes: d3c6dd1fb30d ("dma-buf/sw_sync: Synchronize signal vs syncpt free")
    Signed-off-by: Rob Clark <robdclark@chromium.org>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230818145939.39697-1-robdclark@gmail.com
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Christian König <christian.koenig@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 8c776cd8f1db931faa2fb16bd108c68e5c1b31b8
Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Tue Aug 15 14:15:58 2023 +0100

    pinctrl: renesas: rza2: Add lock around pinctrl_generic{{add,remove}_group,{add,remove}_function}
    
    [ Upstream commit 8fcc1c40b747069644db6102c1d84c942c9d4d86 ]
    
    The pinctrl group and function creation/remove calls expect
    caller to take care of locking. Add lock around these functions.
    
    Fixes: b59d0e782706 ("pinctrl: Add RZ/A2 pin and gpio controller")
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/r/20230815131558.33787-4-biju.das.jz@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 3efa0b7fc28d57a01aba0fb01e8c27dea4e2cb72
Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Tue Aug 15 14:15:57 2023 +0100

    pinctrl: renesas: rzv2m: Fix NULL pointer dereference in rzv2m_dt_subnode_to_map()
    
    [ Upstream commit f982b9d57e7f834138fc908804fe66f646f2b108 ]
    
    Fix the below random NULL pointer crash during boot by serializing
    pinctrl group and function creation/remove calls in
    rzv2m_dt_subnode_to_map() with mutex lock.
    
    Crash logs:
        pc : __pi_strcmp+0x20/0x140
        lr : pinmux_func_name_to_selector+0x68/0xa4
        Call trace:
        __pi_strcmp+0x20/0x140
        pinmux_generic_add_function+0x34/0xcc
        rzv2m_dt_subnode_to_map+0x2e4/0x418
        rzv2m_dt_node_to_map+0x15c/0x18c
        pinctrl_dt_to_map+0x218/0x37c
        create_pinctrl+0x70/0x3d8
    
    While at it, add a comment for lock.
    
    Fixes: 92a9b8252576 ("pinctrl: renesas: Add RZ/V2M pin and gpio controller driver")
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/r/20230815131558.33787-3-biju.das.jz@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit a1f12138b17d7a77af3880700e3909fa41f52f61
Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Tue Aug 15 14:15:56 2023 +0100

    pinctrl: renesas: rzg2l: Fix NULL pointer dereference in rzg2l_dt_subnode_to_map()
    
    [ Upstream commit 661efa2284bbc2338da0424e219603f034072c74 ]
    
    Fix the below random NULL pointer crash during boot by serializing
    pinctrl group and function creation/remove calls in
    rzg2l_dt_subnode_to_map() with mutex lock.
    
    Crash log:
        pc : __pi_strcmp+0x20/0x140
        lr : pinmux_func_name_to_selector+0x68/0xa4
        Call trace:
        __pi_strcmp+0x20/0x140
        pinmux_generic_add_function+0x34/0xcc
        rzg2l_dt_subnode_to_map+0x314/0x44c
        rzg2l_dt_node_to_map+0x164/0x194
        pinctrl_dt_to_map+0x218/0x37c
        create_pinctrl+0x70/0x3d8
    
    While at it, add comments for bitmap_lock and lock.
    
    Fixes: c4c4637eb57f ("pinctrl: renesas: Add RZ/G2L pin and gpio controller driver")
    Tested-by: Chris Paterson <Chris.Paterson2@renesas.com>
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/r/20230815131558.33787-2-biju.das.jz@bp.renesas.com
    Signed-off-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 66bb9745f96eea7984e62fdb17d3f0e352344778
Author: Maciej Strozek <mstrozek@opensource.cirrus.com>
Date:   Thu Aug 17 12:27:11 2023 +0100

    ASoC: cs35l56: Read firmware uuid from a device property instead of _SUB
    
    [ Upstream commit 897a6b5a030e62c21566551c870d81740f82ca13 ]
    
    Use a device property "cirrus,firmware-uid" to get the unique firmware
    identifier instead of using ACPI _SUB. There aren't any products that use
    _SUB.
    
    There will not usually be a _SUB in Soundwire nodes. The ACPI can use a
    _DSD section for custom properties.
    
    There is also a need to support instantiating this driver using software
    nodes. This is for systems where the CS35L56 is a back-end device and the
    ACPI refers only to the front-end audio device - there will not be any ACPI
    references to CS35L56.
    
    Fixes: e49611252900 ("ASoC: cs35l56: Add driver for Cirrus Logic CS35L56")
    Signed-off-by: Maciej Strozek <mstrozek@opensource.cirrus.com>
    Signed-off-by: Richard Fitzgerald <rf@opensource.cirrus.com>
    Link: https://lore.kernel.org/r/20230817112712.16637-2-rf@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 5e9db7d4d3bca6c42a725bf95a55e35e72df773a
Author: Chao Song <chao.song@linux.intel.com>
Date:   Wed Aug 16 16:33:11 2023 +0300

    ASoC: SOF: ipc4-pcm: fix possible null pointer deference
    
    [ Upstream commit 2d218b45848b92b03b220bf4d9bef29f058f866f ]
    
    The call to snd_sof_find_spcm_dai() could return NULL,
    add nullable check for the return value to avoid null
    pointer defenrece.
    
    Fixes: 7cb19007baba ("ASoC: SOF: ipc4-pcm: add hw_params")
    Signed-off-by: Chao Song <chao.song@linux.intel.com>
    Reviewed-by: Bard Liao <yung-chuan.liao@linux.intel.com>
    Reviewed-by: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com>
    Signed-off-by: Peter Ujfalusi <peter.ujfalusi@linux.intel.com>
    Link: https://lore.kernel.org/r/20230816133311.7523-1-peter.ujfalusi@linux.intel.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit dd07e9de2d829498eb2cac5ab0c4174c3b57706d
Author: Biju Das <biju.das.jz@bp.renesas.com>
Date:   Tue Jul 25 18:51:40 2023 +0100

    clk: Fix undefined reference to `clk_rate_exclusive_{get,put}'
    
    [ Upstream commit 2746f13f6f1df7999001d6595b16f789ecc28ad1 ]
    
    The COMMON_CLK config is not enabled in some of the architectures.
    This causes below build issues:
    
    pwm-rz-mtu3.c:(.text+0x114):
    undefined reference to `clk_rate_exclusive_put'
    pwm-rz-mtu3.c:(.text+0x32c):
    undefined reference to `clk_rate_exclusive_get'
    
    Fix these issues by moving clk_rate_exclusive_{get,put} inside COMMON_CLK
    code block, as clk.c is enabled by COMMON_CLK.
    
    Fixes: 55e9b8b7b806 ("clk: add clk_rate_exclusive api")
    Reported-by: kernel test robot <lkp@intel.com>
    Closes: https://lore.kernel.org/all/202307251752.vLfmmhYm-lkp@intel.com/
    Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com>
    Link: https://lore.kernel.org/r/20230725175140.361479-1-biju.das.jz@bp.renesas.com
    Signed-off-by: Stephen Boyd <sboyd@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 686c9e8221f801c52fd2fa13dff7f7ca1d16456c
Author: Zhu Wang <wangzhu9@huawei.com>
Date:   Tue Aug 22 01:52:54 2023 +0000

    scsi: core: raid_class: Remove raid_component_add()
    
    commit 60c5fd2e8f3c42a5abc565ba9876ead1da5ad2b7 upstream.
    
    The raid_component_add() function was added to the kernel tree via patch
    "[SCSI] embryonic RAID class" (2005). Remove this function since it never
    has had any callers in the Linux kernel. And also raid_component_release()
    is only used in raid_component_add(), so it is also removed.
    
    Signed-off-by: Zhu Wang <wangzhu9@huawei.com>
    Link: https://lore.kernel.org/r/20230822015254.184270-1-wangzhu9@huawei.com
    Reviewed-by: Bart Van Assche <bvanassche@acm.org>
    Fixes: 04b5b5cb0136 ("scsi: core: Fix possible memory leak if device_add() fails")
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2684b97b01eb221b475f5a8229cb6466bd775ddc
Author: Neil Armstrong <neil.armstrong@linaro.org>
Date:   Mon Aug 21 14:11:21 2023 +0200

    scsi: ufs: ufs-qcom: Clear qunipro_g4_sel for HW major version > 5
    
    commit c422fbd5cb58c9a078172ae1e9750971b738a197 upstream.
    
    The qunipro_g4_sel clear is also needed for new platforms with major
    version > 5. Fix the version check to take this into account.
    
    Fixes: 9c02aa24bf40 ("scsi: ufs: ufs-qcom: Clear qunipro_g4_sel for HW version major 5")
    Acked-by: Manivannan Sadhasivam <mani@kernel.org>
    Reviewed-by: Nitin Rawat <quic_nitirawa@quicinc.com>
    Signed-off-by: Neil Armstrong <neil.armstrong@linaro.org>
    Link: https://lore.kernel.org/r/20230821-topic-sm8x50-upstream-ufs-major-5-plus-v2-1-f42a4b712e58@linaro.org
    Reviewed-by: "Bao D. Nguyen" <quic_nguyenb@quicinc.com>
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 43dc0a70ed1e21960c28ea0406680af63a93609d
Author: Zhu Wang <wangzhu9@huawei.com>
Date:   Sat Aug 19 08:39:41 2023 +0000

    scsi: snic: Fix double free in snic_tgt_create()
    
    commit 1bd3a76880b2bce017987cf53780b372cf59528e upstream.
    
    Commit 41320b18a0e0 ("scsi: snic: Fix possible memory leak if device_add()
    fails") fixed the memory leak caused by dev_set_name() when device_add()
    failed. However, it did not consider that 'tgt' has already been released
    when put_device(&tgt->dev) is called. Remove kfree(tgt) in the error path
    to avoid double free of 'tgt' and move put_device(&tgt->dev) after the
    removed kfree(tgt) to avoid a use-after-free.
    
    Fixes: 41320b18a0e0 ("scsi: snic: Fix possible memory leak if device_add() fails")
    Signed-off-by: Zhu Wang <wangzhu9@huawei.com>
    Link: https://lore.kernel.org/r/20230819083941.164365-1-wangzhu9@huawei.com
    Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 54fce635ee7f4acaf1782f68bc0889340ef027a3
Author: Yin Fengwei <fengwei.yin@intel.com>
Date:   Tue Aug 8 10:09:17 2023 +0800

    madvise:madvise_free_pte_range(): don't use mapcount() against large folio for sharing check
    
    commit 0e0e9bd5f7b9d40fd03b70092367247d52da1db0 upstream.
    
    Commit 98b211d6415f ("madvise: convert madvise_free_pte_range() to use a
    folio") replaced the page_mapcount() with folio_mapcount() to check
    whether the folio is shared by other mapping.
    
    It's not correct for large folios. folio_mapcount() returns the total
    mapcount of large folio which is not suitable to detect whether the folio
    is shared.
    
    Use folio_estimated_sharers() which returns a estimated number of shares.
    That means it's not 100% correct. It should be OK for madvise case here.
    
    User-visible effects is that the THP is skipped when user call madvise.
    But the correct behavior is THP should be split and processed then.
    
    NOTE: this change is a temporary fix to reduce the user-visible effects
    before the long term fix from David is ready.
    
    Link: https://lkml.kernel.org/r/20230808020917.2230692-4-fengwei.yin@intel.com
    Fixes: 98b211d6415f ("madvise: convert madvise_free_pte_range() to use a folio")
    Signed-off-by: Yin Fengwei <fengwei.yin@intel.com>
    Reviewed-by: Yu Zhao <yuzhao@google.com>
    Reviewed-by: Ryan Roberts <ryan.roberts@arm.com>
    Cc: David Hildenbrand <david@redhat.com>
    Cc: Kefeng Wang <wangkefeng.wang@huawei.com>
    Cc: Matthew Wilcox <willy@infradead.org>
    Cc: Minchan Kim <minchan@kernel.org>
    Cc: Vishal Moola (Oracle) <vishal.moola@gmail.com>
    Cc: Yang Shi <shy828301@gmail.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 63f23017751069830daa715d6c85bc7e56e40538
Author: Yin Fengwei <fengwei.yin@intel.com>
Date:   Tue Aug 8 10:09:15 2023 +0800

    madvise:madvise_cold_or_pageout_pte_range(): don't use mapcount() against large folio for sharing check
    
    commit 2f406263e3e954aa24c1248edcfa9be0c1bb30fa upstream.
    
    Patch series "don't use mapcount() to check large folio sharing", v2.
    
    In madvise_cold_or_pageout_pte_range() and madvise_free_pte_range(),
    folio_mapcount() is used to check whether the folio is shared.  But it's
    not correct as folio_mapcount() returns total mapcount of large folio.
    
    Use folio_estimated_sharers() here as the estimated number is enough.
    
    This patchset will fix the cases:
    User space application call madvise() with MADV_FREE, MADV_COLD and
    MADV_PAGEOUT for specific address range. There are THP mapped to the
    range. Without the patchset, the THP is skipped. With the patch, the
    THP will be split and handled accordingly.
    
    David reported the cow self test skip some cases because of MADV_PAGEOUT
    skip THP:
    https://lore.kernel.org/linux-mm/9e92e42d-488f-47db-ac9d-75b24cd0d037@intel.com/T/#mbf0f2ec7fbe45da47526de1d7036183981691e81
    and I confirmed this patchset make it work again.
    
    
    This patch (of 3):
    
    Commit 07e8c82b5eff ("madvise: convert madvise_cold_or_pageout_pte_range()
    to use folios") replaced the page_mapcount() with folio_mapcount() to
    check whether the folio is shared by other mapping.
    
    It's not correct for large folio.  folio_mapcount() returns the total
    mapcount of large folio which is not suitable to detect whether the folio
    is shared.
    
    Use folio_estimated_sharers() which returns a estimated number of shares.
    That means it's not 100% correct.  It should be OK for madvise case here.
    
    User-visible effects is that the THP is skipped when user call madvise.
    But the correct behavior is THP should be split and processed then.
    
    NOTE: this change is a temporary fix to reduce the user-visible effects
    before the long term fix from David is ready.
    
    Link: https://lkml.kernel.org/r/20230808020917.2230692-1-fengwei.yin@intel.com
    Link: https://lkml.kernel.org/r/20230808020917.2230692-2-fengwei.yin@intel.com
    Fixes: 07e8c82b5eff ("madvise: convert madvise_cold_or_pageout_pte_range() to use folios")
    Signed-off-by: Yin Fengwei <fengwei.yin@intel.com>
    Reviewed-by: Yu Zhao <yuzhao@google.com>
    Reviewed-by: Ryan Roberts <ryan.roberts@arm.com>
    Cc: David Hildenbrand <david@redhat.com>
    Cc: Kefeng Wang <wangkefeng.wang@huawei.com>
    Cc: Matthew Wilcox <willy@infradead.org>
    Cc: Minchan Kim <minchan@kernel.org>
    Cc: Vishal Moola (Oracle) <vishal.moola@gmail.com>
    Cc: Yang Shi <shy828301@gmail.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 26a2b7cec0ddca03014d0fdedd08f5c6b69869e0
Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Thu Jun 1 10:38:04 2023 -0700

    drm/i915: Fix error handling if driver creation fails during probe
    
    commit 718551bbed3ca5308a9f9429305dd074727e8d46 upstream.
    
    If i915_driver_create() fails to create a valid 'i915' object, we
    should just disable the PCI device and return immediately without trying
    to call i915_probe_error() that relies on a valid i915 pointer.
    
    Fixes: 12e6f6dc78e4 ("drm/i915/display: Handle GMD_ID identification in display code")
    Reported-by: Dan Carpenter <dan.carpenter@linaro.org>
    Closes: https://lore.kernel.org/all/55236f93-dcc5-481e-b788-9f7e95b129d8@kili.mountain/
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Reviewed-by: Gustavo Sousa <gustavo.sousa@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230601173804.557756-1-matthew.d.roper@intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 0a47ffcac3c529f68c414a9245712aa0886d4a9a
Author: Oliver Hartkopp <socketcan@hartkopp.net>
Date:   Mon Aug 21 16:45:47 2023 +0200

    can: raw: add missing refcount for memory leak fix
    
    commit c275a176e4b69868576e543409927ae75e3a3288 upstream.
    
    Commit ee8b94c8510c ("can: raw: fix receiver memory leak") introduced
    a new reference to the CAN netdevice that has assigned CAN filters.
    But this new ro->dev reference did not maintain its own refcount which
    lead to another KASAN use-after-free splat found by Eric Dumazet.
    
    This patch ensures a proper refcount for the CAN nedevice.
    
    Fixes: ee8b94c8510c ("can: raw: fix receiver memory leak")
    Reported-by: Eric Dumazet <edumazet@google.com>
    Cc: Ziyang Xuan <william.xuanziyang@huawei.com>
    Signed-off-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Link: https://lore.kernel.org/r/20230821144547.6658-3-socketcan@hartkopp.net
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 06614ca4f18ed90ef7ad2e98fb234f88e2f72f61
Author: Sanjay R Mehta <sanju.mehta@amd.com>
Date:   Wed Aug 2 06:11:49 2023 -0500

    thunderbolt: Fix Thunderbolt 3 display flickering issue on 2nd hot plug onwards
    
    commit 583893a66d731f5da010a3fa38a0460e05f0149b upstream.
    
    Previously, on unplug events, the TMU mode was disabled first
    followed by the Time Synchronization Handshake, irrespective of
    whether the tb_switch_tmu_rate_write() API was successful or not.
    
    However, this caused a problem with Thunderbolt 3 (TBT3)
    devices, as the TSPacketInterval bits were always enabled by default,
    leading the host router to assume that the device router's TMU was
    already enabled and preventing it from initiating the Time
    Synchronization Handshake. As a result, TBT3 monitors experienced
    display flickering from the second hot plug onwards.
    
    To address this issue, we have modified the code to only disable the
    Time Synchronization Handshake during TMU disable if the
    tb_switch_tmu_rate_write() function is successful. This ensures that
    the TBT3 devices function correctly and eliminates the display
    flickering issue.
    
    Co-developed-by: Sanath S <Sanath.S@amd.com>
    Signed-off-by: Sanath S <Sanath.S@amd.com>
    Signed-off-by: Sanjay R Mehta <sanju.mehta@amd.com>
    Cc: stable@vger.kernel.org
    Signed-off-by: Mika Westerberg <mika.westerberg@linux.intel.com>
    [ USB4v2 introduced support for uni-directional TMU mode as part of
      d49b4f043d63 ("thunderbolt: Add support for enhanced uni-directional TMU mode")
      This is not a stable candidate commit, so adjust the code for backport. ]
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit efe4d998330a31db23212bca2e242a1e91cf246f
Author: Igor Mammedov <imammedo@redhat.com>
Date:   Wed Jul 26 14:35:18 2023 +0200

    PCI: acpiphp: Use pci_assign_unassigned_bridge_resources() only for non-root bus
    
    commit cc22522fd55e257c86d340ae9aedc122e705a435 upstream.
    
    40613da52b13 ("PCI: acpiphp: Reassign resources on bridge if necessary")
    changed acpiphp hotplug to use pci_assign_unassigned_bridge_resources()
    which depends on bridge being available, however enable_slot() can be
    called without bridge associated:
    
      1. Legitimate case of hotplug on root bus (widely used in virt world)
    
      2. A (misbehaving) firmware, that sends ACPI Bus Check notifications to
         non existing root ports (Dell Inspiron 7352/0W6WV0), which end up at
         enable_slot(..., bridge = 0) where bus has no bridge assigned to it.
         acpihp doesn't know that it's a bridge, and bus specific 'PCI
         subsystem' can't augment ACPI context with bridge information since
         the PCI device to get this data from is/was not available.
    
    Issue is easy to reproduce with QEMU's 'pc' machine, which supports PCI
    hotplug on hostbridge slots. To reproduce, boot kernel at commit
    40613da52b13 in VM started with following CLI (assuming guest root fs is
    installed on sda1 partition):
    
      # qemu-system-x86_64 -M pc -m 1G -enable-kvm -cpu host \
            -monitor stdio -serial file:serial.log           \
            -kernel arch/x86/boot/bzImage                    \
            -append "root=/dev/sda1 console=ttyS0"           \
            guest_disk.img
    
    Once guest OS is fully booted at qemu prompt:
    
      (qemu) device_add e1000
    
    (check serial.log) it will cause NULL pointer dereference at:
    
      void pci_assign_unassigned_bridge_resources(struct pci_dev *bridge)
      {
        struct pci_bus *parent = bridge->subordinate;
    
      BUG: kernel NULL pointer dereference, address: 0000000000000018
    
       ? pci_assign_unassigned_bridge_resources+0x1f/0x260
       enable_slot+0x21f/0x3e0
       acpiphp_hotplug_notify+0x13d/0x260
       acpi_device_hotplug+0xbc/0x540
       acpi_hotplug_work_fn+0x15/0x20
       process_one_work+0x1f7/0x370
       worker_thread+0x45/0x3b0
    
    The issue was discovered on Dell Inspiron 7352/0W6WV0 laptop with following
    sequence:
    
      1. Suspend to RAM
      2. Wake up with the same backtrace being observed:
      3. 2nd suspend to RAM attempt makes laptop freeze
    
    Fix it by using __pci_bus_assign_resources() instead of
    pci_assign_unassigned_bridge_resources() as we used to do, but only in case
    when bus doesn't have a bridge associated (to cover for the case of ACPI
    event on hostbridge or non existing root port).
    
    That lets us keep hotplug on root bus working like it used to and at the
    same time keeps resource reassignment usable on root ports (and other 1st
    level bridges) that was fixed by 40613da52b13.
    
    Fixes: 40613da52b13 ("PCI: acpiphp: Reassign resources on bridge if necessary")
    Link: https://lore.kernel.org/r/20230726123518.2361181-2-imammedo@redhat.com
    Reported-by: Woody Suwalski <terraluna977@gmail.com>
    Tested-by: Woody Suwalski <terraluna977@gmail.com>
    Tested-by: Michal Koutný <mkoutny@suse.com>
    Link: https://lore.kernel.org/r/11fc981c-af49-ce64-6b43-3e282728bd1a@gmail.com
    Signed-off-by: Igor Mammedov <imammedo@redhat.com>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Acked-by: Rafael J. Wysocki <rafael@kernel.org>
    Acked-by: Michael S. Tsirkin <mst@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit f80b4b818e5e620942f70d2ebf32e0831560e040
Author: Wei Chen <harperchen1110@gmail.com>
Date:   Thu Aug 10 08:23:33 2023 +0000

    media: vcodec: Fix potential array out-of-bounds in encoder queue_setup
    
    commit e7f2e65699e2290fd547ec12a17008764e5d9620 upstream.
    
    variable *nplanes is provided by user via system call argument. The
    possible value of q_data->fmt->num_planes is 1-3, while the value
    of *nplanes can be 1-8. The array access by index i can cause array
    out-of-bounds.
    
    Fix this bug by checking *nplanes against the array size.
    
    Fixes: 4e855a6efa54 ("[media] vcodec: mediatek: Add Mediatek V4L2 Video Encoder Driver")
    Signed-off-by: Wei Chen <harperchen1110@gmail.com>
    Cc: stable@vger.kernel.org
    Reviewed-by: Chen-Yu Tsai <wenst@chromium.org>
    Signed-off-by: Hans Verkuil <hverkuil-cisco@xs4all.nl>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 423e75565761e9bd637766e31044a70dde16af69
Author: Mario Limonciello <mario.limonciello@amd.com>
Date:   Fri Aug 18 09:48:50 2023 -0500

    pinctrl: amd: Mask wake bits on probe again
    
    commit 6bc3462a0f5ecaa376a0b3d76dafc55796799e17 upstream.
    
    Shubhra reports that their laptop is heating up over s2idle. Even though
    it's getting into the deepest state, it appears to be having spurious
    wakeup events.
    
    While debugging a tangential issue with the RTC Carsten reports that recent
    6.1.y based kernel face a similar problem.
    
    Looking at acpidump and GPIO register comparisons these spurious wakeup
    events are from the GPIO associated with the I2C touchpad on both laptops
    and occur even when the touchpad is not marked as a wake source by the
    kernel.
    
    This means that the boot firmware has programmed these bits and because
    Linux didn't touch them lead to spurious wakeup events from that GPIO.
    
    To fix this issue, restore most of the code that previously would clear all
    the bits associated with wakeup sources. This will allow the kernel to only
    program the wake up sources that are necessary.
    
    This is similar to what was done previously; but only the wake bits are
    cleared by default instead of interrupts and wake bits.  If any other
    problems are reported then it may make sense to clear interrupts again too.
    
    Cc: Sachi King <nakato@nakato.io>
    Cc: stable@vger.kernel.org
    Cc: Thorsten Leemhuis <regressions@leemhuis.info>
    Fixes: 65f6c7c91cb2 ("pinctrl: amd: Revert "pinctrl: amd: disable and mask interrupts on probe"")
    Reported-by: Shubhra Prakash Nandi <email2shubhra@gmail.com>
    Closes: https://bugzilla.kernel.org/show_bug.cgi?id=217754
    Reported-by: Carsten Hatger <xmb8dsv4@gmail.com>
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=217626#c28
    Signed-off-by: Mario Limonciello <mario.limonciello@amd.com>
    Link: https://lore.kernel.org/r/20230818144850.1439-1-mario.limonciello@amd.com
    Signed-off-by: Linus Walleij <linus.walleij@linaro.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ce2e8904a81727ad8fab3c0369a4e446838fb78a
Author: Rob Herring <robh@kernel.org>
Date:   Fri Aug 18 15:40:57 2023 -0500

    of: dynamic: Refactor action prints to not use "%pOF" inside devtree_lock
    
    commit 914d9d831e6126a6e7a92e27fcfaa250671be42c upstream.
    
    While originally it was fine to format strings using "%pOF" while
    holding devtree_lock, this now causes a deadlock.  Lockdep reports:
    
        of_get_parent from of_fwnode_get_parent+0x18/0x24
        ^^^^^^^^^^^^^
        of_fwnode_get_parent from fwnode_count_parents+0xc/0x28
        fwnode_count_parents from fwnode_full_name_string+0x18/0xac
        fwnode_full_name_string from device_node_string+0x1a0/0x404
        device_node_string from pointer+0x3c0/0x534
        pointer from vsnprintf+0x248/0x36c
        vsnprintf from vprintk_store+0x130/0x3b4
    
    Fix this by moving the printing in __of_changeset_entry_apply() outside
    the lock. As the only difference in the multiple prints is the action
    name, use the existing "action_names" to refactor the prints into a
    single print.
    
    Fixes: a92eb7621b9fb2c2 ("lib/vsprintf: Make use of fwnode API to obtain node names and separators")
    Cc: stable@vger.kernel.org
    Reported-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/r/20230801-dt-changeset-fixes-v3-2-5f0410e007dd@kernel.org
    Signed-off-by: Rob Herring <robh@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit d92815c542d7985e4d91a100070e63294dee98bc
Author: Rob Herring <robh@kernel.org>
Date:   Fri Aug 18 15:40:56 2023 -0500

    of: unittest: Fix EXPECT for parse_phandle_with_args_map() test
    
    commit 0aeae3788e28f64ccb95405d4dc8cd80637ffaea upstream.
    
    Commit 12e17243d8a1 ("of: base: improve error msg in
    of_phandle_iterator_next()") added printing of the phandle value on
    error, but failed to update the unittest.
    
    Fixes: 12e17243d8a1 ("of: base: improve error msg in of_phandle_iterator_next()")
    Cc: stable@vger.kernel.org
    Reviewed-by: Geert Uytterhoeven <geert+renesas@glider.be>
    Link: https://lore.kernel.org/r/20230801-dt-changeset-fixes-v3-1-5f0410e007dd@kernel.org
    Signed-off-by: Rob Herring <robh@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit fa700d9cda9a0229ec234dbff4cc75abc3fbaf92
Author: Arnd Bergmann <arnd@arndb.de>
Date:   Fri Aug 11 15:10:13 2023 +0200

    radix tree: remove unused variable
    
    commit d59070d1076ec5114edb67c87658aeb1d691d381 upstream.
    
    Recent versions of clang warn about an unused variable, though older
    versions saw the 'slot++' as a use and did not warn:
    
    radix-tree.c:1136:50: error: parameter 'slot' set but not used [-Werror,-Wunused-but-set-parameter]
    
    It's clearly not needed any more, so just remove it.
    
    Link: https://lkml.kernel.org/r/20230811131023.2226509-1-arnd@kernel.org
    Fixes: 3a08cd52c37c7 ("radix tree: Remove multiorder support")
    Signed-off-by: Arnd Bergmann <arnd@arndb.de>
    Cc: Matthew Wilcox <willy@infradead.org>
    Cc: Nathan Chancellor <nathan@kernel.org>
    Cc: Nick Desaulniers <ndesaulniers@google.com>
    Cc: Peng Zhang <zhangpeng.00@bytedance.com>
    Cc: Rong Tao <rongtao@cestc.cn>
    Cc: Tom Rix <trix@redhat.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 8f6813c62d2f5cf6fb5d91671f4c97ba76075dce
Author: Mingzheng Xing <xingmingzheng@iscas.ac.cn>
Date:   Fri Aug 25 03:08:52 2023 +0800

    riscv: Fix build errors using binutils2.37 toolchains
    
    commit ef21fa7c198e04f3d3053b1c5b5f2b4b225c3350 upstream.
    
    When building the kernel with binutils 2.37 and GCC-11.1.0/GCC-11.2.0,
    the following error occurs:
    
      Assembler messages:
      Error: cannot find default versions of the ISA extension `zicsr'
      Error: cannot find default versions of the ISA extension `zifencei'
    
    The above error originated from this commit of binutils[0], which has been
    resolved and backported by GCC-12.1.0[1] and GCC-11.3.0[2].
    
    So fix this by change the GCC version in
    CONFIG_TOOLCHAIN_NEEDS_OLD_ISA_SPEC to GCC-11.3.0.
    
    Link: https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=f0bae2552db1dd4f1995608fbf6648fcee4e9e0c [0]
    Link: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=ca2bbb88f999f4d3cc40e89bc1aba712505dd598 [1]
    Link: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=d29f5d6ab513c52fd872f532c492e35ae9fd6671 [2]
    Fixes: ca09f772ccca ("riscv: Handle zicsr/zifencei issue between gcc and binutils")
    Reported-by: Conor Dooley <conor.dooley@microchip.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Mingzheng Xing <xingmingzheng@iscas.ac.cn>
    Link: https://lore.kernel.org/r/20230824190852.45470-1-xingmingzheng@iscas.ac.cn
    Closes: https://lore.kernel.org/all/20230823-captive-abdomen-befd942a4a73@wendy/
    Reviewed-by: Conor Dooley <conor.dooley@microchip.com>
    Tested-by: Conor Dooley <conor.dooley@microchip.com>
    Signed-off-by: Palmer Dabbelt <palmer@rivosinc.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 1b7ac88ef2e4e205a32d139e81e42d51d84bf6ed
Author: Mingzheng Xing <xingmingzheng@iscas.ac.cn>
Date:   Thu Aug 10 00:56:48 2023 +0800

    riscv: Handle zicsr/zifencei issue between gcc and binutils
    
    commit ca09f772cccaeec4cd05a21528c37a260aa2dd2c upstream.
    
    Binutils-2.38 and GCC-12.1.0 bumped[0][1] the default ISA spec to the newer
    20191213 version which moves some instructions from the I extension to the
    Zicsr and Zifencei extensions. So if one of the binutils and GCC exceeds
    that version, we should explicitly specifying Zicsr and Zifencei via -march
    to cope with the new changes. but this only occurs when binutils >= 2.36
    and GCC >= 11.1.0. It's a different story when binutils < 2.36.
    
    binutils-2.36 supports the Zifencei extension[2] and splits Zifencei and
    Zicsr from I[3]. GCC-11.1.0 is particular[4] because it add support Zicsr
    and Zifencei extension for -march. binutils-2.35 does not support the
    Zifencei extension, and does not need to specify Zicsr and Zifencei when
    working with GCC >= 12.1.0.
    
    To make our lives easier, let's relax the check to binutils >= 2.36 in
    CONFIG_TOOLCHAIN_NEEDS_EXPLICIT_ZICSR_ZIFENCEI. For the other two cases,
    where clang < 17 or GCC < 11.1.0, we will deal with them in
    CONFIG_TOOLCHAIN_NEEDS_OLD_ISA_SPEC.
    
    For more information, please refer to:
    commit 6df2a016c0c8 ("riscv: fix build with binutils 2.38")
    commit e89c2e815e76 ("riscv: Handle zicsr/zifencei issues between clang and binutils")
    
    Link: https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=aed44286efa8ae8717a77d94b51ac3614e2ca6dc [0]
    Link: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=98416dbb0a62579d4a7a4a76bab51b5b52fec2cd [1]
    Link: https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=5a1b31e1e1cee6e9f1c92abff59cdcfff0dddf30 [2]
    Link: https://sourceware.org/git/?p=binutils-gdb.git;a=commit;h=729a53530e86972d1143553a415db34e6e01d5d2 [3]
    Link: https://gcc.gnu.org/git/?p=gcc.git;a=commit;h=b03be74bad08c382da47e048007a78fa3fb4ef49 [4]
    Link: https://lore.kernel.org/all/20230308220842.1231003-1-conor@kernel.org
    Link: https://lore.kernel.org/all/20230223220546.52879-1-conor@kernel.org
    Reviewed-by: Conor Dooley <conor.dooley@microchip.com>
    Acked-by: Guo Ren <guoren@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Mingzheng Xing <xingmingzheng@iscas.ac.cn>
    Link: https://lore.kernel.org/r/20230809165648.21071-1-xingmingzheng@iscas.ac.cn
    Signed-off-by: Palmer Dabbelt <palmer@rivosinc.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 5039e4afc05012f218a2631b269f4b8c7bc97d28
Author: Helge Deller <deller@gmx.de>
Date:   Fri Aug 25 21:50:33 2023 +0200

    lib/clz_ctz.c: Fix __clzdi2() and __ctzdi2() for 32-bit kernels
    
    commit 382d4cd1847517ffcb1800fd462b625db7b2ebea upstream.
    
    The gcc compiler translates on some architectures the 64-bit
    __builtin_clzll() function to a call to the libgcc function __clzdi2(),
    which should take a 64-bit parameter on 32- and 64-bit platforms.
    
    But in the current kernel code, the built-in __clzdi2() function is
    defined to operate (wrongly) on 32-bit parameters if BITS_PER_LONG ==
    32, thus the return values on 32-bit kernels are in the range from
    [0..31] instead of the expected [0..63] range.
    
    This patch fixes the in-kernel functions __clzdi2() and __ctzdi2() to
    take a 64-bit parameter on 32-bit kernels as well, thus it makes the
    functions identical for 32- and 64-bit kernels.
    
    This bug went unnoticed since kernel 3.11 for over 10 years, and here
    are some possible reasons for that:
    
     a) Some architectures have assembly instructions to count the bits and
        which are used instead of calling __clzdi2(), e.g. on x86 the bsr
        instruction and on ppc cntlz is used. On such architectures the
        wrong __clzdi2() implementation isn't used and as such the bug has
        no effect and won't be noticed.
    
     b) Some architectures link to libgcc.a, and the in-kernel weak
        functions get replaced by the correct 64-bit variants from libgcc.a.
    
     c) __builtin_clzll() and __clzdi2() doesn't seem to be used in many
        places in the kernel, and most likely only in uncritical functions,
        e.g. when printing hex values via seq_put_hex_ll(). The wrong return
        value will still print the correct number, but just in a wrong
        formatting (e.g. with too many leading zeroes).
    
     d) 32-bit kernels aren't used that much any longer, so they are less
        tested.
    
    A trivial testcase to verify if the currently running 32-bit kernel is
    affected by the bug is to look at the output of /proc/self/maps:
    
    Here the kernel uses a correct implementation of __clzdi2():
    
      root@debian:~# cat /proc/self/maps
      00010000-00019000 r-xp 00000000 08:05 787324     /usr/bin/cat
      00019000-0001a000 rwxp 00009000 08:05 787324     /usr/bin/cat
      0001a000-0003b000 rwxp 00000000 00:00 0          [heap]
      f7551000-f770d000 r-xp 00000000 08:05 794765     /usr/lib/hppa-linux-gnu/libc.so.6
      ...
    
    and this kernel uses the broken implementation of __clzdi2():
    
      root@debian:~# cat /proc/self/maps
      0000000010000-0000000019000 r-xp 00000000 000000008:000000005 787324  /usr/bin/cat
      0000000019000-000000001a000 rwxp 000000009000 000000008:000000005 787324  /usr/bin/cat
      000000001a000-000000003b000 rwxp 00000000 00:00 0  [heap]
      00000000f73d1000-00000000f758d000 r-xp 00000000 000000008:000000005 794765  /usr/lib/hppa-linux-gnu/libc.so.6
      ...
    
    Signed-off-by: Helge Deller <deller@gmx.de>
    Fixes: 4df87bb7b6a22 ("lib: add weak clz/ctz functions")
    Cc: Chanho Min <chanho.min@lge.com>
    Cc: Geert Uytterhoeven <geert@linux-m68k.org>
    Cc: stable@vger.kernel.org # v3.11+
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit d64a94bc6ef77c0873d2f54b795619b6829a3a6a
Author: Hans de Goede <hdegoede@redhat.com>
Date:   Mon Aug 21 11:09:27 2023 +0200

    ACPI: resource: Fix IRQ override quirk for PCSpecialist Elimina Pro 16 M
    
    commit 453b014e2c294abf762d3bce12e91ce4b34055e6 upstream.
    
    It turns out that some PCSpecialist Elimina Pro 16 M models
    have "GM6BGEQ" as DMI product-name instead of "Elimina Pro 16 M",
    causing the existing DMI quirk to not work on these models.
    
    The DMI board-name is always "GM6BGEQ", so match on that instead.
    
    Fixes: 56fec0051a69 ("ACPI: resource: Add IRQ override quirk for PCSpecialist Elimina Pro 16 M")
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=217394#c36
    Cc: All applicable <stable@vger.kernel.org>
    Signed-off-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 4cb9ace298f30b7f9681fe04a124032516feb6fb
Author: Sven Eckelmann <sven@narfation.org>
Date:   Mon Aug 21 21:48:48 2023 +0200

    batman-adv: Hold rtnl lock during MTU update via netlink
    
    commit 987aae75fc1041072941ffb622b45ce2359a99b9 upstream.
    
    The automatic recalculation of the maximum allowed MTU is usually triggered
    by code sections which are already rtnl lock protected by callers outside
    of batman-adv. But when the fragmentation setting is changed via
    batman-adv's own batadv genl family, then the rtnl lock is not yet taken.
    
    But dev_set_mtu requires that the caller holds the rtnl lock because it
    uses netdevice notifiers. And this code will then fail the check for this
    lock:
    
      RTNL: assertion failed at net/core/dev.c (1953)
    
    Cc: stable@vger.kernel.org
    Reported-by: syzbot+f8812454d9b3ac00d282@syzkaller.appspotmail.com
    Fixes: c6a953cce8d0 ("batman-adv: Trigger events for auto adjusted MTU")
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/20230821-batadv-missing-mtu-rtnl-lock-v1-1-1c5a7bfe861e@narfation.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 55d18e4b2bfe52fb622bd8db1913744c9df117d1
Author: Remi Pommarel <repk@triplefau.lt>
Date:   Wed Aug 9 17:29:13 2023 +0200

    batman-adv: Fix batadv_v_ogm_aggr_send memory leak
    
    commit 421d467dc2d483175bad4fb76a31b9e5a3d744cf upstream.
    
    When batadv_v_ogm_aggr_send is called for an inactive interface, the skb
    is silently dropped by batadv_v_ogm_send_to_if() but never freed causing
    the following memory leak:
    
      unreferenced object 0xffff00000c164800 (size 512):
        comm "kworker/u8:1", pid 2648, jiffies 4295122303 (age 97.656s)
        hex dump (first 32 bytes):
          00 80 af 09 00 00 ff ff e1 09 00 00 75 01 60 83  ............u.`.
          1f 00 00 00 b8 00 00 00 15 00 05 00 da e3 d3 64  ...............d
        backtrace:
          [<0000000007ad20f6>] __kmalloc_track_caller+0x1a8/0x310
          [<00000000d1029e55>] kmalloc_reserve.constprop.0+0x70/0x13c
          [<000000008b9d4183>] __alloc_skb+0xec/0x1fc
          [<00000000c7af5051>] __netdev_alloc_skb+0x48/0x23c
          [<00000000642ee5f5>] batadv_v_ogm_aggr_send+0x50/0x36c
          [<0000000088660bd7>] batadv_v_ogm_aggr_work+0x24/0x40
          [<0000000042fc2606>] process_one_work+0x3b0/0x610
          [<000000002f2a0b1c>] worker_thread+0xa0/0x690
          [<0000000059fae5d4>] kthread+0x1fc/0x210
          [<000000000c587d3a>] ret_from_fork+0x10/0x20
    
    Free the skb in that case to fix this leak.
    
    Cc: stable@vger.kernel.org
    Fixes: 0da0035942d4 ("batman-adv: OGMv2 - add basic infrastructure")
    Signed-off-by: Remi Pommarel <repk@triplefau.lt>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit e265dca9ba59fca2dceee3b8a11383636e3ed65a
Author: Remi Pommarel <repk@triplefau.lt>
Date:   Fri Aug 4 11:39:36 2023 +0200

    batman-adv: Fix TT global entry leak when client roamed back
    
    commit d25ddb7e788d34cf27ff1738d11a87cb4b67d446 upstream.
    
    When a client roamed back to a node before it got time to destroy the
    pending local entry (i.e. within the same originator interval) the old
    global one is directly removed from hash table and left as such.
    
    But because this entry had an extra reference taken at lookup (i.e using
    batadv_tt_global_hash_find) there is no way its memory will be reclaimed
    at any time causing the following memory leak:
    
      unreferenced object 0xffff0000073c8000 (size 18560):
        comm "softirq", pid 0, jiffies 4294907738 (age 228.644s)
        hex dump (first 32 bytes):
          06 31 ac 12 c7 7a 05 00 01 00 00 00 00 00 00 00  .1...z..........
          2c ad be 08 00 80 ff ff 6c b6 be 08 00 80 ff ff  ,.......l.......
        backtrace:
          [<00000000ee6e0ffa>] kmem_cache_alloc+0x1b4/0x300
          [<000000000ff2fdbc>] batadv_tt_global_add+0x700/0xe20
          [<00000000443897c7>] _batadv_tt_update_changes+0x21c/0x790
          [<000000005dd90463>] batadv_tt_update_changes+0x3c/0x110
          [<00000000a2d7fc57>] batadv_tt_tvlv_unicast_handler_v1+0xafc/0xe10
          [<0000000011793f2a>] batadv_tvlv_containers_process+0x168/0x2b0
          [<00000000b7cbe2ef>] batadv_recv_unicast_tvlv+0xec/0x1f4
          [<0000000042aef1d8>] batadv_batman_skb_recv+0x25c/0x3a0
          [<00000000bbd8b0a2>] __netif_receive_skb_core.isra.0+0x7a8/0xe90
          [<000000004033d428>] __netif_receive_skb_one_core+0x64/0x74
          [<000000000f39a009>] __netif_receive_skb+0x48/0xe0
          [<00000000f2cd8888>] process_backlog+0x174/0x344
          [<00000000507d6564>] __napi_poll+0x58/0x1f4
          [<00000000b64ef9eb>] net_rx_action+0x504/0x590
          [<00000000056fa5e4>] _stext+0x1b8/0x418
          [<00000000878879d6>] run_ksoftirqd+0x74/0xa4
      unreferenced object 0xffff00000bae1a80 (size 56):
        comm "softirq", pid 0, jiffies 4294910888 (age 216.092s)
        hex dump (first 32 bytes):
          00 78 b1 0b 00 00 ff ff 0d 50 00 00 00 00 00 00  .x.......P......
          00 00 00 00 00 00 00 00 50 c8 3c 07 00 00 ff ff  ........P.<.....
        backtrace:
          [<00000000ee6e0ffa>] kmem_cache_alloc+0x1b4/0x300
          [<00000000d9aaa49e>] batadv_tt_global_add+0x53c/0xe20
          [<00000000443897c7>] _batadv_tt_update_changes+0x21c/0x790
          [<000000005dd90463>] batadv_tt_update_changes+0x3c/0x110
          [<00000000a2d7fc57>] batadv_tt_tvlv_unicast_handler_v1+0xafc/0xe10
          [<0000000011793f2a>] batadv_tvlv_containers_process+0x168/0x2b0
          [<00000000b7cbe2ef>] batadv_recv_unicast_tvlv+0xec/0x1f4
          [<0000000042aef1d8>] batadv_batman_skb_recv+0x25c/0x3a0
          [<00000000bbd8b0a2>] __netif_receive_skb_core.isra.0+0x7a8/0xe90
          [<000000004033d428>] __netif_receive_skb_one_core+0x64/0x74
          [<000000000f39a009>] __netif_receive_skb+0x48/0xe0
          [<00000000f2cd8888>] process_backlog+0x174/0x344
          [<00000000507d6564>] __napi_poll+0x58/0x1f4
          [<00000000b64ef9eb>] net_rx_action+0x504/0x590
          [<00000000056fa5e4>] _stext+0x1b8/0x418
          [<00000000878879d6>] run_ksoftirqd+0x74/0xa4
    
    Releasing the extra reference from batadv_tt_global_hash_find even at
    roam back when batadv_tt_global_free is called fixes this memory leak.
    
    Cc: stable@vger.kernel.org
    Fixes: 068ee6e204e1 ("batman-adv: roaming handling mechanism redesign")
    Signed-off-by: Remi Pommarel <repk@triplefau.lt>
    Signed-off-by; Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 5effaa05704aadeb888e9bd73c4dd458c3e09fc8
Author: Remi Pommarel <repk@triplefau.lt>
Date:   Fri Jul 28 15:38:50 2023 +0200

    batman-adv: Do not get eth header before batadv_check_management_packet
    
    commit eac27a41ab641de074655d2932fc7f8cdb446881 upstream.
    
    If received skb in batadv_v_elp_packet_recv or batadv_v_ogm_packet_recv
    is either cloned or non linearized then its data buffer will be
    reallocated by batadv_check_management_packet when skb_cow or
    skb_linearize get called. Thus geting ethernet header address inside
    skb data buffer before batadv_check_management_packet had any chance to
    reallocate it could lead to the following kernel panic:
    
      Unable to handle kernel paging request at virtual address ffffff8020ab069a
      Mem abort info:
        ESR = 0x96000007
        EC = 0x25: DABT (current EL), IL = 32 bits
        SET = 0, FnV = 0
        EA = 0, S1PTW = 0
        FSC = 0x07: level 3 translation fault
      Data abort info:
        ISV = 0, ISS = 0x00000007
        CM = 0, WnR = 0
      swapper pgtable: 4k pages, 39-bit VAs, pgdp=0000000040f45000
      [ffffff8020ab069a] pgd=180000007fffa003, p4d=180000007fffa003, pud=180000007fffa003, pmd=180000007fefe003, pte=0068000020ab0706
      Internal error: Oops: 96000007 [#1] SMP
      Modules linked in: ahci_mvebu libahci_platform libahci dvb_usb_af9035 dvb_usb_dib0700 dib0070 dib7000m dibx000_common ath11k_pci ath10k_pci ath10k_core mwl8k_new nf_nat_sip nf_conntrack_sip xhci_plat_hcd xhci_hcd nf_nat_pptp nf_conntrack_pptp at24 sbsa_gwdt
      CPU: 1 PID: 16 Comm: ksoftirqd/1 Not tainted 5.15.42-00066-g3242268d425c-dirty #550
      Hardware name: A8k (DT)
      pstate: 60000005 (nZCv daif -PAN -UAO -TCO -DIT -SSBS BTYPE=--)
      pc : batadv_is_my_mac+0x60/0xc0
      lr : batadv_v_ogm_packet_recv+0x98/0x5d0
      sp : ffffff8000183820
      x29: ffffff8000183820 x28: 0000000000000001 x27: ffffff8014f9af00
      x26: 0000000000000000 x25: 0000000000000543 x24: 0000000000000003
      x23: ffffff8020ab0580 x22: 0000000000000110 x21: ffffff80168ae880
      x20: 0000000000000000 x19: ffffff800b561000 x18: 0000000000000000
      x17: 0000000000000000 x16: 0000000000000000 x15: 00dc098924ae0032
      x14: 0f0405433e0054b0 x13: ffffffff00000080 x12: 0000004000000001
      x11: 0000000000000000 x10: 0000000000000000 x9 : 0000000000000000
      x8 : 0000000000000000 x7 : ffffffc076dae000 x6 : ffffff8000183700
      x5 : ffffffc00955e698 x4 : ffffff80168ae000 x3 : ffffff80059cf000
      x2 : ffffff800b561000 x1 : ffffff8020ab0696 x0 : ffffff80168ae880
      Call trace:
       batadv_is_my_mac+0x60/0xc0
       batadv_v_ogm_packet_recv+0x98/0x5d0
       batadv_batman_skb_recv+0x1b8/0x244
       __netif_receive_skb_core.isra.0+0x440/0xc74
       __netif_receive_skb_one_core+0x14/0x20
       netif_receive_skb+0x68/0x140
       br_pass_frame_up+0x70/0x80
       br_handle_frame_finish+0x108/0x284
       br_handle_frame+0x190/0x250
       __netif_receive_skb_core.isra.0+0x240/0xc74
       __netif_receive_skb_list_core+0x6c/0x90
       netif_receive_skb_list_internal+0x1f4/0x310
       napi_complete_done+0x64/0x1d0
       gro_cell_poll+0x7c/0xa0
       __napi_poll+0x34/0x174
       net_rx_action+0xf8/0x2a0
       _stext+0x12c/0x2ac
       run_ksoftirqd+0x4c/0x7c
       smpboot_thread_fn+0x120/0x210
       kthread+0x140/0x150
       ret_from_fork+0x10/0x20
      Code: f9403844 eb03009f 54fffee1 f94
    
    Thus ethernet header address should only be fetched after
    batadv_check_management_packet has been called.
    
    Fixes: 0da0035942d4 ("batman-adv: OGMv2 - add basic infrastructure")
    Cc: stable@vger.kernel.org
    Signed-off-by: Remi Pommarel <repk@triplefau.lt>
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 1f82cd26c65061354c7940d9e9cf6e118a862126
Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed Jul 19 10:01:15 2023 +0200

    batman-adv: Don't increase MTU when set by user
    
    commit d8e42a2b0addf238be8b3b37dcd9795a5c1be459 upstream.
    
    If the user set an MTU value, it usually means that there are special
    requirements for the MTU. But if an interface gots activated, the MTU was
    always recalculated and then the user set value was overwritten.
    
    The only reason why this user set value has to be overwritten, is when the
    MTU has to be decreased because batman-adv is not able to transfer packets
    with the user specified size.
    
    Fixes: c6c8fea29769 ("net: Add batman-adv meshing protocol")
    Cc: stable@vger.kernel.org
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2c783344218d2d22548438fde17b68f21cd48640
Author: Sven Eckelmann <sven@narfation.org>
Date:   Wed Jul 19 09:29:29 2023 +0200

    batman-adv: Trigger events for auto adjusted MTU
    
    commit c6a953cce8d0438391e6da48c8d0793d3fbfcfa6 upstream.
    
    If an interface changes the MTU, it is expected that an NETDEV_PRECHANGEMTU
    and NETDEV_CHANGEMTU notification events is triggered. This worked fine for
    .ndo_change_mtu based changes because core networking code took care of it.
    But for auto-adjustments after hard-interfaces changes, these events were
    simply missing.
    
    Due to this problem, non-batman-adv components weren't aware of MTU changes
    and thus couldn't perform their own tasks correctly.
    
    Fixes: c6c8fea29769 ("net: Add batman-adv meshing protocol")
    Cc: stable@vger.kernel.org
    Signed-off-by: Sven Eckelmann <sven@narfation.org>
    Signed-off-by: Simon Wunderlich <sw@simonwunderlich.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 21cd99431aae3868748c7d9791a14e29103091f8
Author: Christian Göttsche <cgzones@googlemail.com>
Date:   Fri Aug 18 17:33:58 2023 +0200

    selinux: set next pointer before attaching to list
    
    commit 70d91dc9b2ac91327d0eefd86163abc3548effa6 upstream.
    
    Set the next pointer in filename_trans_read_helper() before attaching
    the new node under construction to the list, otherwise garbage would be
    dereferenced on subsequent failure during cleanup in the out goto label.
    
    Cc: <stable@vger.kernel.org>
    Fixes: 430059024389 ("selinux: implement new format of filename transitions")
    Signed-off-by: Christian Göttsche <cgzones@googlemail.com>
    Signed-off-by: Paul Moore <paul@paul-moore.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 14fa028a2e63869855e4d3796622f21d790453cb
Author: Benjamin Coddington <bcodding@redhat.com>
Date:   Fri Aug 4 10:52:20 2023 -0400

    nfsd: Fix race to FREE_STATEID and cl_revoked
    
    commit 3b816601e279756e781e6c4d9b3f3bd21a72ac67 upstream.
    
    We have some reports of linux NFS clients that cannot satisfy a linux knfsd
    server that always sets SEQ4_STATUS_RECALLABLE_STATE_REVOKED even though
    those clients repeatedly walk all their known state using TEST_STATEID and
    receive NFS4_OK for all.
    
    Its possible for revoke_delegation() to set NFS4_REVOKED_DELEG_STID, then
    nfsd4_free_stateid() finds the delegation and returns NFS4_OK to
    FREE_STATEID.  Afterward, revoke_delegation() moves the same delegation to
    cl_revoked.  This would produce the observed client/server effect.
    
    Fix this by ensuring that the setting of sc_type to NFS4_REVOKED_DELEG_STID
    and move to cl_revoked happens within the same cl_lock.  This will allow
    nfsd4_free_stateid() to properly remove the delegation from cl_revoked.
    
    Link: https://bugzilla.redhat.com/show_bug.cgi?id=2217103
    Link: https://bugzilla.redhat.com/show_bug.cgi?id=2176575
    Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
    Cc: stable@vger.kernel.org # v4.17+
    Reviewed-by: Jeff Layton <jlayton@kernel.org>
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit a3a91119964d5eaa7377c858e522e2b4ec69c947
Author: Trond Myklebust <trond.myklebust@hammerspace.com>
Date:   Tue Aug 8 21:17:11 2023 -0400

    NFS: Fix a use after free in nfs_direct_join_group()
    
    commit be2fd1560eb57b7298aa3c258ddcca0d53ecdea3 upstream.
    
    Be more careful when tearing down the subrequests of an O_DIRECT write
    as part of a retransmission.
    
    Reported-by: Chris Mason <clm@fb.com>
    Fixes: ed5d588fe47f ("NFS: Try to join page groups before an O_DIRECT retransmission")
    Cc: stable@vger.kernel.org
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c8df36eedb657f2e0537e56a4c238e5338e7081c
Author: Ryusuke Konishi <konishi.ryusuke@gmail.com>
Date:   Sat Aug 5 22:20:38 2023 +0900

    nilfs2: fix general protection fault in nilfs_lookup_dirty_data_buffers()
    
    commit f83913f8c5b882a312e72b7669762f8a5c9385e4 upstream.
    
    A syzbot stress test reported that create_empty_buffers() called from
    nilfs_lookup_dirty_data_buffers() can cause a general protection fault.
    
    Analysis using its reproducer revealed that the back reference "mapping"
    from a page/folio has been changed to NULL after dirty page/folio gang
    lookup in nilfs_lookup_dirty_data_buffers().
    
    Fix this issue by excluding pages/folios from being collected if, after
    acquiring a lock on each page/folio, its back reference "mapping" differs
    from the pointer to the address space struct that held the page/folio.
    
    Link: https://lkml.kernel.org/r/20230805132038.6435-1-konishi.ryusuke@gmail.com
    Signed-off-by: Ryusuke Konishi <konishi.ryusuke@gmail.com>
    Reported-by: syzbot+0ad741797f4565e7e2d2@syzkaller.appspotmail.com
    Closes: https://lkml.kernel.org/r/0000000000002930a705fc32b231@google.com
    Tested-by: Ryusuke Konishi <konishi.ryusuke@gmail.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit bca3e63be00e34d49139bd3135854c94908422fa
Author: T.J. Mercier <tjmercier@google.com>
Date:   Mon Aug 14 15:16:36 2023 +0000

    mm: multi-gen LRU: don't spin during memcg release
    
    commit 6867c7a3320669cbe44b905a3eb35db725c6d470 upstream.
    
    When a memcg is in the process of being released mem_cgroup_tryget will
    fail because its reference count has already reached 0.  This can happen
    during reclaim if the memcg has already been offlined, and we reclaim all
    remaining pages attributed to the offlined memcg.  shrink_many attempts to
    skip the empty memcg in this case, and continue reclaiming from the
    remaining memcgs in the old generation.  If there is only one memcg
    remaining, or if all remaining memcgs are in the process of being released
    then shrink_many will spin until all memcgs have finished being released.
    The release occurs through a workqueue, so it can take a while before
    kswapd is able to make any further progress.
    
    This fix results in reductions in kswapd activity and direct reclaim in
    a test where 28 apps (working set size > total memory) are repeatedly
    launched in a random sequence:
    
                                           A          B      delta   ratio(%)
               allocstall_movable       5962       3539      -2423     -40.64
                allocstall_normal       2661       2417       -244      -9.17
    kswapd_high_wmark_hit_quickly      53152       7594     -45558     -85.71
                       pageoutrun      57365      11750     -45615     -79.52
    
    Link: https://lkml.kernel.org/r/20230814151636.1639123-1-tjmercier@google.com
    Fixes: e4dde56cd208 ("mm: multi-gen LRU: per-node lru_gen_folio lists")
    Signed-off-by: T.J. Mercier <tjmercier@google.com>
    Acked-by: Yu Zhao <yuzhao@google.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 56d11051190dea6eb4b5db8584e1fd4cac88852c
Author: Miaohe Lin <linmiaohe@huawei.com>
Date:   Tue Jun 27 19:28:08 2023 +0800

    mm: memory-failure: fix unexpected return value in soft_offline_page()
    
    commit e2c1ab070fdc81010ec44634838d24fce9ff9e53 upstream.
    
    When page_handle_poison() fails to handle the hugepage or free page in
    retry path, soft_offline_page() will return 0 while -EBUSY is expected in
    this case.
    
    Consequently the user will think soft_offline_page succeeds while it in
    fact failed.  So the user will not try again later in this case.
    
    Link: https://lkml.kernel.org/r/20230627112808.1275241-1-linmiaohe@huawei.com
    Fixes: b94e02822deb ("mm,hwpoison: try to narrow window race for free pages")
    Signed-off-by: Miaohe Lin <linmiaohe@huawei.com>
    Acked-by: Naoya Horiguchi <naoya.horiguchi@nec.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 22178c6e6c2dc270ef2ff7169620f372fe691edf
Author: Alexandre Ghiti <alexghiti@rivosinc.com>
Date:   Wed Aug 9 18:46:33 2023 +0200

    mm: add a call to flush_cache_vmap() in vmap_pfn()
    
    commit a50420c79731fc5cf27ad43719c1091e842a2606 upstream.
    
    flush_cache_vmap() must be called after new vmalloc mappings are installed
    in the page table in order to allow architectures to make sure the new
    mapping is visible.
    
    It could lead to a panic since on some architectures (like powerpc),
    the page table walker could see the wrong pte value and trigger a
    spurious page fault that can not be resolved (see commit f1cb8f9beba8
    ("powerpc/64s/radix: avoid ptesync after set_pte and
    ptep_set_access_flags")).
    
    But actually the patch is aiming at riscv: the riscv specification
    allows the caching of invalid entries in the TLB, and since we recently
    removed the vmalloc page fault handling, we now need to emit a tlb
    shootdown whenever a new vmalloc mapping is emitted
    (https://lore.kernel.org/linux-riscv/20230725132246.817726-1-alexghiti@rivosinc.com/).
    That's a temporary solution, there are ways to avoid that :)
    
    Link: https://lkml.kernel.org/r/20230809164633.1556126-1-alexghiti@rivosinc.com
    Fixes: 3e9a9e256b1e ("mm: add a vmap_pfn function")
    Reported-by: Dylan Jhong <dylan@andestech.com>
    Closes: https://lore.kernel.org/linux-riscv/ZMytNY2J8iyjbPPy@atctrx.andestech.com/
    Signed-off-by: Alexandre Ghiti <alexghiti@rivosinc.com>
    Reviewed-by: Christoph Hellwig <hch@lst.de>
    Reviewed-by: Palmer Dabbelt <palmer@rivosinc.com>
    Acked-by: Palmer Dabbelt <palmer@rivosinc.com>
    Reviewed-by: Dylan Jhong <dylan@andestech.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 0677bed47996e490ec2991999b8b31f73b30a6fe
Author: Dietmar Eggemann <dietmar.eggemann@arm.com>
Date:   Mon Aug 21 23:19:56 2023 +0100

    cgroup/cpuset: Free DL BW in case can_attach() fails
    
    commit 2ef269ef1ac006acf974793d975539244d77b28f upstream.
    
    cpuset_can_attach() can fail. Postpone DL BW allocation until all tasks
    have been checked. DL BW is not allocated per-task but as a sum over
    all DL tasks migrating.
    
    If multiple controllers are attached to the cgroup next to the cpuset
    controller a non-cpuset can_attach() can fail. In this case free DL BW
    in cpuset_cancel_attach().
    
    Finally, update cpuset DL task count (nr_deadline_tasks) only in
    cpuset_attach().
    
    Suggested-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Dietmar Eggemann <dietmar.eggemann@arm.com>
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit d1cfa53e5e4e4c8f5ed31c1a8a979b5f4a161a17
Author: Dietmar Eggemann <dietmar.eggemann@arm.com>
Date:   Mon Aug 21 23:19:55 2023 +0100

    sched/deadline: Create DL BW alloc, free & check overflow interface
    
    commit 85989106feb734437e2d598b639991b9185a43a6 upstream.
    
    While moving a set of tasks between exclusive cpusets,
    cpuset_can_attach() -> task_can_attach() calls dl_cpu_busy(..., p) for
    DL BW overflow checking and per-task DL BW allocation on the destination
    root_domain for the DL tasks in this set.
    
    This approach has the issue of not freeing already allocated DL BW in
    the following error cases:
    
    (1) The set of tasks includes multiple DL tasks and DL BW overflow
        checking fails for one of the subsequent DL tasks.
    
    (2) Another controller next to the cpuset controller which is attached
        to the same cgroup fails in its can_attach().
    
    To address this problem rework dl_cpu_busy():
    
    (1) Split it into dl_bw_check_overflow() & dl_bw_alloc() and add a
        dedicated dl_bw_free().
    
    (2) dl_bw_alloc() & dl_bw_free() take a `u64 dl_bw` parameter instead of
        a `struct task_struct *p` used in dl_cpu_busy(). This allows to
        allocate DL BW for a set of tasks too rather than only for a single
        task.
    
    Signed-off-by: Dietmar Eggemann <dietmar.eggemann@arm.com>
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c95a751498c9f3cc7c6f94b14da196bd38610d9c
Author: Juri Lelli <juri.lelli@redhat.com>
Date:   Mon Aug 21 23:19:54 2023 +0100

    cgroup/cpuset: Iterate only if DEADLINE tasks are present
    
    commit c0f78fd5edcf29b2822ac165f9248a6c165e8554 upstream.
    
    update_tasks_root_domain currently iterates over all tasks even if no
    DEADLINE task is present on the cpuset/root domain for which bandwidth
    accounting is being rebuilt. This has been reported to introduce 10+ ms
    delays on suspend-resume operations.
    
    Skip the costly iteration for cpusets that don't contain DEADLINE tasks.
    
    Reported-by: Qais Yousef (Google) <qyousef@layalina.io>
    Link: https://lore.kernel.org/lkml/20230206221428.2125324-1-qyousef@layalina.io/
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 74fac5bb0d375d6b8f5e7997925063c509089c9c
Author: Juri Lelli <juri.lelli@redhat.com>
Date:   Mon Aug 21 23:19:53 2023 +0100

    sched/cpuset: Keep track of SCHED_DEADLINE task in cpusets
    
    commit 6c24849f5515e4966d94fa5279bdff4acf2e9489 upstream.
    
    Qais reported that iterating over all tasks when rebuilding root domains
    for finding out which ones are DEADLINE and need their bandwidth
    correctly restored on such root domains can be a costly operation (10+
    ms delays on suspend-resume).
    
    To fix the problem keep track of the number of DEADLINE tasks belonging
    to each cpuset and then use this information (followup patch) to only
    perform the above iteration if DEADLINE tasks are actually present in
    the cpuset for which a corresponding root domain is being rebuilt.
    
    Reported-by: Qais Yousef (Google) <qyousef@layalina.io>
    Link: https://lore.kernel.org/lkml/20230206221428.2125324-1-qyousef@layalina.io/
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 00f3719c85bf78412c9e6d9178099df5dd76634b
Author: Juri Lelli <juri.lelli@redhat.com>
Date:   Mon Aug 21 23:19:52 2023 +0100

    sched/cpuset: Bring back cpuset_mutex
    
    commit 111cd11bbc54850f24191c52ff217da88a5e639b upstream.
    
    Turns out percpu_cpuset_rwsem - commit 1243dc518c9d ("cgroup/cpuset:
    Convert cpuset_mutex to percpu_rwsem") - wasn't such a brilliant idea,
    as it has been reported to cause slowdowns in workloads that need to
    change cpuset configuration frequently and it is also not implementing
    priority inheritance (which causes troubles with realtime workloads).
    
    Convert percpu_cpuset_rwsem back to regular cpuset_mutex. Also grab it
    only for SCHED_DEADLINE tasks (other policies don't care about stable
    cpusets anyway).
    
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 4d17b2ea4ee6ce37b226ec39b7c25991f3410dcd
Author: Juri Lelli <juri.lelli@redhat.com>
Date:   Mon Aug 21 23:19:51 2023 +0100

    cgroup/cpuset: Rename functions dealing with DEADLINE accounting
    
    commit ad3a557daf6915296a43ef97a3e9c48e076c9dd8 upstream.
    
    rebuild_root_domains() and update_tasks_root_domain() have neutral
    names, but actually deal with DEADLINE bandwidth accounting.
    
    Rename them to use 'dl_' prefix so that intent is more clear.
    
    No functional change.
    
    Suggested-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Juri Lelli <juri.lelli@redhat.com>
    Reviewed-by: Waiman Long <longman@redhat.com>
    Signed-off-by: Tejun Heo <tj@kernel.org>
    Signed-off-by: Qais Yousef (Google) <qyousef@layalina.io>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit aefabccb1334ad58522af4bcad5f08fea8d442ea
Author: Jani Nikula <jani.nikula@intel.com>
Date:   Fri Aug 4 11:45:59 2023 +0300

    drm/i915: fix display probe for IVB Q and IVB D GT2 server
    
    commit 423ffe62c06ae241ad460f4629dddb9dcf55e060 upstream.
    
    The current display probe is unable to differentiate between IVB Q and
    IVB D GT2 server, as they both have the same device id, but different
    subvendor and subdevice. This leads to the latter being misidentified as
    the former, and should just end up not having a display. However, the no
    display case returns a NULL as the display device info, and promptly
    oopses.
    
    As the IVB Q case is rare, and we're anyway moving towards GMD ID,
    handle the identification requiring subvendor and subdevice as a special
    case first, instead of unnecessarily growing the intel_display_ids[]
    array with subvendor and subdevice.
    
    [    5.425298] BUG: kernel NULL pointer dereference, address: 0000000000000000
    [    5.426059] #PF: supervisor read access in kernel mode
    [    5.426810] #PF: error_code(0x0000) - not-present page
    [    5.427570] PGD 0 P4D 0
    [    5.428285] Oops: 0000 [#1] PREEMPT SMP PTI
    [    5.429035] CPU: 0 PID: 137 Comm: (udev-worker) Not tainted 6.4.0-1-amd64 #1  Debian 6.4.4-1
    [    5.429759] Hardware name: HP HP Z220 SFF Workstation/HP Z220 SFF Workstation, BIOS 4.19-218-gb184e6e0a1 02/02/2023
    [    5.430485] RIP: 0010:intel_device_info_driver_create+0xf1/0x120 [i915]
    [    5.431338] Code: 48 8b 97 80 1b 00 00 89 8f c0 1b 00 00 48 89 b7 b0 1b 00 00 48 89 97 b8 1b 00 00 0f b7 fd e8 76 e8 14 00 48 89 83 50 1b 00 00 <48> 8b 08 48 89 8b c4 1b 00 00 48 8b 48 08 48 89 8b cc 1b 00 00 8b
    [    5.432920] RSP: 0018:ffffb8254044fb98 EFLAGS: 00010206
    [    5.433707] RAX: 0000000000000000 RBX: ffff923076e80000 RCX: 0000000000000000
    [    5.434494] RDX: 0000000000000260 RSI: 0000000100001000 RDI: 000000000000016a
    [    5.435277] RBP: 000000000000016a R08: ffffb8254044fb00 R09: 0000000000000000
    [    5.436055] R10: ffff922d02761de8 R11: 00657361656c6572 R12: ffffffffc0e5d140
    [    5.436867] R13: ffff922d00b720d0 R14: 0000000076e80000 R15: ffff923078c0cae8
    [    5.437646] FS:  00007febd19a18c0(0000) GS:ffff92307c000000(0000) knlGS:0000000000000000
    [    5.438434] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [    5.439218] CR2: 0000000000000000 CR3: 000000010256e002 CR4: 00000000001706f0
    [    5.440009] Call Trace:
    [    5.440824]  <TASK>
    [    5.441611]  ? __die+0x23/0x70
    [    5.442394]  ? page_fault_oops+0x17d/0x4c0
    [    5.443173]  ? exc_page_fault+0x7f/0x180
    [    5.443949]  ? asm_exc_page_fault+0x26/0x30
    [    5.444756]  ? intel_device_info_driver_create+0xf1/0x120 [i915]
    [    5.445652]  ? intel_device_info_driver_create+0xea/0x120 [i915]
    [    5.446545]  i915_driver_probe+0x7f/0xb60 [i915]
    [    5.447431]  ? drm_privacy_screen_get+0x15c/0x1a0 [drm]
    [    5.448240]  local_pci_probe+0x45/0xa0
    [    5.449013]  pci_device_probe+0xc7/0x240
    [    5.449748]  really_probe+0x19e/0x3e0
    [    5.450464]  ? __pfx___driver_attach+0x10/0x10
    [    5.451172]  __driver_probe_device+0x78/0x160
    [    5.451870]  driver_probe_device+0x1f/0x90
    [    5.452601]  __driver_attach+0xd2/0x1c0
    [    5.453293]  bus_for_each_dev+0x88/0xd0
    [    5.453989]  bus_add_driver+0x116/0x220
    [    5.454672]  driver_register+0x59/0x100
    [    5.455336]  i915_init+0x25/0xc0 [i915]
    [    5.456104]  ? __pfx_i915_init+0x10/0x10 [i915]
    [    5.456882]  do_one_initcall+0x5d/0x240
    [    5.457511]  do_init_module+0x60/0x250
    [    5.458126]  __do_sys_finit_module+0xac/0x120
    [    5.458721]  do_syscall_64+0x60/0xc0
    [    5.459314]  ? syscall_exit_to_user_mode+0x1b/0x40
    [    5.459897]  ? do_syscall_64+0x6c/0xc0
    [    5.460510]  entry_SYSCALL_64_after_hwframe+0x72/0xdc
    [    5.461082] RIP: 0033:0x7febd20b0eb9
    [    5.461648] Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2f 1f 0d 00 f7 d8 64 89 01 48
    [    5.462905] RSP: 002b:00007fffabb1ba78 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
    [    5.463554] RAX: ffffffffffffffda RBX: 0000561e6304f410 RCX: 00007febd20b0eb9
    [    5.464201] RDX: 0000000000000000 RSI: 00007febd2244f0d RDI: 0000000000000015
    [    5.464869] RBP: 00007febd2244f0d R08: 0000000000000000 R09: 000000000000000a
    [    5.465512] R10: 0000000000000015 R11: 0000000000000246 R12: 0000000000020000
    [    5.466124] R13: 0000000000000000 R14: 0000561e63032b60 R15: 000000000000000a
    [    5.466700]  </TASK>
    [    5.467271] Modules linked in: i915(+) drm_buddy video crc32_pclmul sr_mod hid_generic wmi crc32c_intel i2c_algo_bit sd_mod cdrom drm_display_helper cec usbhid rc_core ghash_clmulni_intel hid sha512_ssse3 ttm sha512_generic xhci_pci ehci_pci xhci_hcd ehci_hcd nvme ahci drm_kms_helper nvme_core libahci t10_pi libata psmouse aesni_intel scsi_mod crypto_simd i2c_i801 scsi_common crc64_rocksoft_generic cryptd i2c_smbus drm lpc_ich crc64_rocksoft crc_t10dif e1000e usbcore crct10dif_generic usb_common crct10dif_pclmul crc64 crct10dif_common button
    [    5.469750] CR2: 0000000000000000
    [    5.470364] ---[ end trace 0000000000000000 ]---
    [    5.470971] RIP: 0010:intel_device_info_driver_create+0xf1/0x120 [i915]
    [    5.471699] Code: 48 8b 97 80 1b 00 00 89 8f c0 1b 00 00 48 89 b7 b0 1b 00 00 48 89 97 b8 1b 00 00 0f b7 fd e8 76 e8 14 00 48 89 83 50 1b 00 00 <48> 8b 08 48 89 8b c4 1b 00 00 48 8b 48 08 48 89 8b cc 1b 00 00 8b
    [    5.473034] RSP: 0018:ffffb8254044fb98 EFLAGS: 00010206
    [    5.473698] RAX: 0000000000000000 RBX: ffff923076e80000 RCX: 0000000000000000
    [    5.474371] RDX: 0000000000000260 RSI: 0000000100001000 RDI: 000000000000016a
    [    5.475045] RBP: 000000000000016a R08: ffffb8254044fb00 R09: 0000000000000000
    [    5.475725] R10: ffff922d02761de8 R11: 00657361656c6572 R12: ffffffffc0e5d140
    [    5.476405] R13: ffff922d00b720d0 R14: 0000000076e80000 R15: ffff923078c0cae8
    [    5.477124] FS:  00007febd19a18c0(0000) GS:ffff92307c000000(0000) knlGS:0000000000000000
    [    5.477811] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [    5.478499] CR2: 0000000000000000 CR3: 000000010256e002 CR4: 00000000001706f0
    
    Fixes: 69d439818fe5 ("drm/i915/display: Make display responsible for probing its own IP")
    Closes: https://gitlab.freedesktop.org/drm/intel/-/issues/8991
    Cc: Matt Roper <matthew.d.roper@intel.com>
    Cc: Andrzej Hajda <andrzej.hajda@intel.com>
    Reviewed-by: Luca Coelho <luciano.coelho@intel.com>
    Reviewed-by: Matt Roper <matthew.d.roper@intel.com>
    Signed-off-by: Jani Nikula <jani.nikula@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230804084600.1005818-1-jani.nikula@intel.com
    (cherry picked from commit 1435188307d128671f677eb908e165666dd83652)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 6621912f42212e1229c1d871837bd955857de9f3
Author: Matt Roper <matthew.d.roper@intel.com>
Date:   Tue May 23 12:56:08 2023 -0700

    drm/i915/display: Handle GMD_ID identification in display code
    
    commit 12e6f6dc78e4f4a418648fb1a9c0cd2ae9b3430b upstream.
    
    For platforms with GMD_ID support (i.e., everything MTL and beyond),
    identification of the display IP present should be based on the contents
    of the GMD_ID register rather than a PCI devid match.
    
    Note that since GMD_ID readout requires access to the PCI BAR, a slight
    change to the driver init sequence is needed --- pci_enable_device() is
    now called before i915_driver_create().
    
    v2:
     - Fix use of uninitialized i915 pointer in error path if
       pci_enable_device() fails before the i915 device is created.  (lkp)
     - Use drm_device parameter to intel_display_device_probe.  This goes
       against i915 conventions, but since the primary goal here is to make
       it easy to call this function from other drivers (like Xe) and since
       we don't need anything from the i915 structure, this seems like an
       exception where drm_device is a more natural fit.
    v3:
     - Go back do drm_i915_private for intel_display_device_probe.  (Jani)
     - Move forward decl to top of header.  (Jani)
    
    Signed-off-by: Matt Roper <matthew.d.roper@intel.com>
    Reviewed-by: Andrzej Hajda <andrzej.hajda@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230523195609.73627-6-matthew.d.roper@intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 517771333fd41cb79fc6709dd32f6ad9838b5d95
Author: Feng Tang <feng.tang@intel.com>
Date:   Wed Aug 23 14:57:47 2023 +0800

    x86/fpu: Set X86_FEATURE_OSXSAVE feature after enabling OSXSAVE in CR4
    
    commit 2c66ca3949dc701da7f4c9407f2140ae425683a5 upstream.
    
    0-Day found a 34.6% regression in stress-ng's 'af-alg' test case, and
    bisected it to commit b81fac906a8f ("x86/fpu: Move FPU initialization into
    arch_cpu_finalize_init()"), which optimizes the FPU init order, and moves
    the CR4_OSXSAVE enabling into a later place:
    
       arch_cpu_finalize_init
           identify_boot_cpu
               identify_cpu
                   generic_identify
                       get_cpu_cap --> setup cpu capability
           ...
           fpu__init_cpu
               fpu__init_cpu_xstate
                   cr4_set_bits(X86_CR4_OSXSAVE);
    
    As the FPU is not yet initialized the CPU capability setup fails to set
    X86_FEATURE_OSXSAVE. Many security module like 'camellia_aesni_avx_x86_64'
    depend on this feature and therefore fail to load, causing the regression.
    
    Cure this by setting X86_FEATURE_OSXSAVE feature right after OSXSAVE
    enabling.
    
    [ tglx: Moved it into the actual BSP FPU initialization code and added a comment ]
    
    Fixes: b81fac906a8f ("x86/fpu: Move FPU initialization into arch_cpu_finalize_init()")
    Reported-by: kernel test robot <oliver.sang@intel.com>
    Signed-off-by: Feng Tang <feng.tang@intel.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Cc: stable@vger.kernel.org
    Link: https://lore.kernel.org/lkml/202307192135.203ac24e-oliver.sang@intel.com
    Link: https://lore.kernel.org/lkml/20230823065747.92257-1-feng.tang@intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 4b04c422ea8da971ef579d0ff6357de6226d0d71
Author: Rick Edgecombe <rick.p.edgecombe@intel.com>
Date:   Fri Aug 18 10:03:05 2023 -0700

    x86/fpu: Invalidate FPU state correctly on exec()
    
    commit 1f69383b203e28cf8a4ca9570e572da1699f76cd upstream.
    
    The thread flag TIF_NEED_FPU_LOAD indicates that the FPU saved state is
    valid and should be reloaded when returning to userspace. However, the
    kernel will skip doing this if the FPU registers are already valid as
    determined by fpregs_state_valid(). The logic embedded there considers
    the state valid if two cases are both true:
    
      1: fpu_fpregs_owner_ctx points to the current tasks FPU state
      2: the last CPU the registers were live in was the current CPU.
    
    This is usually correct logic. A CPU’s fpu_fpregs_owner_ctx is set to
    the current FPU during the fpregs_restore_userregs() operation, so it
    indicates that the registers have been restored on this CPU. But this
    alone doesn’t preclude that the task hasn’t been rescheduled to a
    different CPU, where the registers were modified, and then back to the
    current CPU. To verify that this was not the case the logic relies on the
    second condition. So the assumption is that if the registers have been
    restored, AND they haven’t had the chance to be modified (by being
    loaded on another CPU), then they MUST be valid on the current CPU.
    
    Besides the lazy FPU optimizations, the other cases where the FPU
    registers might not be valid are when the kernel modifies the FPU register
    state or the FPU saved buffer. In this case the operation modifying the
    FPU state needs to let the kernel know the correspondence has been
    broken. The comment in “arch/x86/kernel/fpu/context.h” has:
    /*
    ...
     * If the FPU register state is valid, the kernel can skip restoring the
     * FPU state from memory.
     *
     * Any code that clobbers the FPU registers or updates the in-memory
     * FPU state for a task MUST let the rest of the kernel know that the
     * FPU registers are no longer valid for this task.
     *
     * Either one of these invalidation functions is enough. Invalidate
     * a resource you control: CPU if using the CPU for something else
     * (with preemption disabled), FPU for the current task, or a task that
     * is prevented from running by the current task.
     */
    
    However, this is not completely true. When the kernel modifies the
    registers or saved FPU state, it can only rely on
    __fpu_invalidate_fpregs_state(), which wipes the FPU’s last_cpu
    tracking. The exec path instead relies on fpregs_deactivate(), which sets
    the CPU’s FPU context to NULL. This was observed to fail to restore the
    reset FPU state to the registers when returning to userspace in the
    following scenario:
    
    1. A task is executing in userspace on CPU0
            - CPU0’s FPU context points to tasks
            - fpu->last_cpu=CPU0
    
    2. The task exec()’s
    
    3. While in the kernel the task is preempted
            - CPU0 gets a thread executing in the kernel (such that no other
                    FPU context is activated)
            - Scheduler sets task’s fpu->last_cpu=CPU0 when scheduling out
    
    4. Task is migrated to CPU1
    
    5. Continuing the exec(), the task gets to
       fpu_flush_thread()->fpu_reset_fpregs()
            - Sets CPU1’s fpu context to NULL
            - Copies the init state to the task’s FPU buffer
            - Sets TIF_NEED_FPU_LOAD on the task
    
    6. The task reschedules back to CPU0 before completing the exec() and
       returning to userspace
            - During the reschedule, scheduler finds TIF_NEED_FPU_LOAD is set
            - Skips saving the registers and updating task’s fpu→last_cpu,
              because TIF_NEED_FPU_LOAD is the canonical source.
    
    7. Now CPU0’s FPU context is still pointing to the task’s, and
       fpu->last_cpu is still CPU0. So fpregs_state_valid() returns true even
       though the reset FPU state has not been restored.
    
    So the root cause is that exec() is doing the wrong kind of invalidate. It
    should reset fpu->last_cpu via __fpu_invalidate_fpregs_state(). Further,
    fpu__drop() doesn't really seem appropriate as the task (and FPU) are not
    going away, they are just getting reset as part of an exec. So switch to
    __fpu_invalidate_fpregs_state().
    
    Also, delete the misleading comment that says that either kind of
    invalidate will be enough, because it’s not always the case.
    
    Fixes: 33344368cb08 ("x86/fpu: Clean up the fpu__clear() variants")
    Reported-by: Lei Wang <lei4.wang@intel.com>
    Signed-off-by: Rick Edgecombe <rick.p.edgecombe@intel.com>
    Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
    Tested-by: Lijun Pan <lijun.pan@intel.com>
    Reviewed-by: Sohil Mehta <sohil.mehta@intel.com>
    Acked-by: Lijun Pan <lijun.pan@intel.com>
    Cc: stable@vger.kernel.org
    Link: https://lore.kernel.org/r/20230818170305.502891-1-rick.p.edgecombe@intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 0c2a9b7ba1b8401135b5322e4f8cb4ac17895471
Author: Huacai Chen <chenhuacai@kernel.org>
Date:   Sat Aug 26 22:21:57 2023 +0800

    LoongArch: Fix hw_breakpoint_control() for watchpoints
    
    commit 9730870b484e9de852b51df08a8b357b1129489e upstream.
    
    In hw_breakpoint_control(), encode_ctrl_reg() has already encoded the
    MWPnCFG3_LoadEn/MWPnCFG3_StoreEn bits in info->ctrl. We don't need to
    add (1 << MWPnCFG3_LoadEn | 1 << MWPnCFG3_StoreEn) unconditionally.
    
    Otherwise we can't set read watchpoint and write watchpoint separately.
    
    Cc: stable@vger.kernel.org
    Signed-off-by: Huacai Chen <chenhuacai@loongson.cn>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 8771f80bafa350f58f25d72d3fb2c621c93642a4
Author: Imre Deak <imre.deak@intel.com>
Date:   Tue Aug 22 14:30:15 2023 +0300

    drm/i915: Fix HPD polling, reenabling the output poll work as needed
    
    commit 1dcc437427bbcebc8381226352f7ade08a271191 upstream.
    
    After the commit in the Fixes: line below, HPD polling stopped working
    on i915, since after that change calling drm_kms_helper_poll_enable()
    doesn't restart drm_mode_config::output_poll_work if the work was
    stopped (no connectors needing polling) and enabling polling for a
    connector (during runtime suspend or detecting an HPD IRQ storm).
    
    After the above change calling drm_kms_helper_poll_enable() is a nop
    after it's been called already and polling for some connectors was
    disabled/re-enabled.
    
    Fix this by calling drm_kms_helper_poll_reschedule() added in the
    previous patch instead, which reschedules the work whenever expected.
    
    Fixes: d33a54e3991d ("drm/probe_helper: sort out poll_running vs poll_enabled")
    CC: stable@vger.kernel.org # 6.4+
    Cc: Dmitry Baryshkov <dmitry.baryshkov@linaro.org>
    Cc: dri-devel@lists.freedesktop.org
    Reviewed-by: Jouni Högander <jouni.hogander@intel.com>
    Signed-off-by: Imre Deak <imre.deak@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230822113015.41224-2-imre.deak@intel.com
    (cherry picked from commit 50452f2f76852322620b63e62922b85e955abe94)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 40b67b55337aaac6ba413e23e07af6ae1a7eeb58
Author: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
Date:   Fri Aug 18 10:14:36 2023 +0530

    drm/display/dp: Fix the DP DSC Receiver cap size
    
    commit 5ad1ab30ac0809d2963ddcf39ac34317a24a2f17 upstream.
    
    DP DSC Receiver Capabilities are exposed via DPCD 60h-6Fh.
    Fix the DSC RECEIVER CAP SIZE accordingly.
    
    Fixes: ffddc4363c28 ("drm/dp: Add DP DSC DPCD receiver capability size define and missing SHIFT")
    Cc: Anusha Srivatsa <anusha.srivatsa@intel.com>
    Cc: Manasi Navare <manasi.d.navare@intel.com>
    Cc: <stable@vger.kernel.org> # v5.0+
    
    Signed-off-by: Ankit Nautiyal <ankit.k.nautiyal@intel.com>
    Reviewed-by: Stanislav Lisovskiy <stanislav.lisovskiy@intel.com>
    Signed-off-by: Jani Nikula <jani.nikula@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230818044436.177806-1-ankit.k.nautiyal@intel.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 503d787d303e21fd25c166dbee9a0fa917c9c448
Author: Anshuman Gupta <anshuman.gupta@intel.com>
Date:   Wed Aug 16 18:22:16 2023 +0530

    drm/i915/dgfx: Enable d3cold at s2idle
    
    commit 2872144aec04baa7e43ecd2a60f7f0be3aa843fd upstream.
    
    System wide suspend already has support for lmem save/restore during
    suspend therefore enabling d3cold for s2idle and keepng it disable for
    runtime PM.(Refer below commit for d3cold runtime PM disable justification)
    'commit 66eb93e71a7a ("drm/i915/dgfx: Keep PCI autosuspend control
    'on' by default on all dGPU")'
    
    It will reduce the DG2 Card power consumption to ~0 Watt
    for s2idle power KPI.
    
    v2:
    - Added "Cc: stable@vger.kernel.org".
    
    Closes: https://gitlab.freedesktop.org/drm/intel/-/issues/8755
    Cc: stable@vger.kernel.org
    Cc: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Anshuman Gupta <anshuman.gupta@intel.com>
    Reviewed-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Tested-by: Aaron Ma <aaron.ma@canonical.com>
    Tested-by: Jianshui Yu <Jianshui.yu@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230816125216.1722002-1-anshuman.gupta@intel.com
    (cherry picked from commit 2643e6d1f2a5e51877be24042d53cf956589be10)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit dd8683e0af50a59a9215966fb8e6726d753d069a
Author: David Michael <fedora.dm0@gmail.com>
Date:   Tue Aug 15 21:42:41 2023 -0400

    drm/panfrost: Skip speed binning on EOPNOTSUPP
    
    commit f19df6e4de64b7fc6d71f192aa9ff3b701e4bade upstream.
    
    Encountered on an ARM Mali-T760 MP4, attempting to read the nvmem
    variable can also return EOPNOTSUPP instead of ENOENT when speed
    binning is unsupported.
    
    Cc: <stable@vger.kernel.org>
    Fixes: 7d690f936e9b ("drm/panfrost: Add basic support for speed binning")
    Signed-off-by: David Michael <fedora.dm0@gmail.com>
    Reviewed-by: Steven Price <steven.price@arm.com>
    Signed-off-by: Steven Price <steven.price@arm.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/87msyryd7y.fsf@gmail.com
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 933f1fc826a5cb962ece5cc69540eddca3d868e0
Author: Imre Deak <imre.deak@intel.com>
Date:   Tue Aug 22 14:30:14 2023 +0300

    drm: Add an HPD poll helper to reschedule the poll work
    
    commit a94e7ccfc400c024976f3c2f31689ed843498b7c upstream.
    
    Add a helper to reschedule drm_mode_config::output_poll_work after
    polling has been enabled for a connector (and needing a reschedule,
    since previously polling was disabled for all connectors and hence
    output_poll_work was not running).
    
    This is needed by the next patch fixing HPD polling on i915.
    
    CC: stable@vger.kernel.org # 6.4+
    Cc: Dmitry Baryshkov <dmitry.baryshkov@linaro.org>
    Cc: dri-devel@lists.freedesktop.org
    Reviewed-by: Jouni Högander <jouni.hogander@intel.com>
    Reviewed-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org>
    Signed-off-by: Imre Deak <imre.deak@intel.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230822113015.41224-1-imre.deak@intel.com
    (cherry picked from commit fe2352fd64029918174de4b460dfe6df0c6911cd)
    Signed-off-by: Rodrigo Vivi <rodrigo.vivi@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 6969e4500d86576ee78ecbf3d612e964ca5488ba
Author: Zack Rusin <zackr@vmware.com>
Date:   Fri Aug 18 00:13:01 2023 -0400

    drm/vmwgfx: Fix possible invalid drm gem put calls
    
    commit f9e96bf1905479f18e83a3a4c314a8dfa56ede2c upstream.
    
    vmw_bo_unreference sets the input buffer to null on exit, resulting in
    null ptr deref's on the subsequent drm gem put calls.
    
    This went unnoticed because only very old userspace would be exercising
    those paths but it wouldn't be hard to hit on old distros with brand
    new kernels.
    
    Introduce a new function that abstracts unrefing of user bo's to make
    the code cleaner and more explicit.
    
    Signed-off-by: Zack Rusin <zackr@vmware.com>
    Reported-by: Ian Forbes <iforbes@vmware.com>
    Fixes: 9ef8d83e8e25 ("drm/vmwgfx: Do not drop the reference to the handle too soon")
    Cc: <stable@vger.kernel.org> # v6.4+
    Reviewed-by: Maaz Mombasawala<mombasawalam@vmware.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230818041301.407636-1-zack@kde.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 5574b0cbb493f3b3cbb2c381e8e1dac52a70213f
Author: Zack Rusin <zackr@vmware.com>
Date:   Fri Jun 16 15:09:34 2023 -0400

    drm/vmwgfx: Fix shader stage validation
    
    commit 14abdfae508228a7307f7491b5c4215ae70c6542 upstream.
    
    For multiple commands the driver was not correctly validating the shader
    stages resulting in possible kernel oopses. The validation code was only.
    if ever, checking the upper bound on the shader stages but never a lower
    bound (valid shader stages start at 1 not 0).
    
    Fixes kernel oopses ending up in vmw_binding_add, e.g.:
    Oops: 0000 [#1] PREEMPT SMP PTI
    CPU: 1 PID: 2443 Comm: testcase Not tainted 6.3.0-rc4-vmwgfx #1
    Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 11/12/2020
    RIP: 0010:vmw_binding_add+0x4c/0x140 [vmwgfx]
    Code: 7e 30 49 83 ff 0e 0f 87 ea 00 00 00 4b 8d 04 7f 89 d2 89 cb 48 c1 e0 03 4c 8b b0 40 3d 93 c0 48 8b 80 48 3d 93 c0 49 0f af de <48> 03 1c d0 4c 01 e3 49 8>
    RSP: 0018:ffffb8014416b968 EFLAGS: 00010206
    RAX: ffffffffc0933ec0 RBX: 0000000000000000 RCX: 0000000000000000
    RDX: 00000000ffffffff RSI: ffffb8014416b9c0 RDI: ffffb8014316f000
    RBP: ffffb8014416b998 R08: 0000000000000003 R09: 746f6c735f726564
    R10: ffffffffaaf2bda0 R11: 732e676e69646e69 R12: ffffb8014316f000
    R13: ffffb8014416b9c0 R14: 0000000000000040 R15: 0000000000000006
    FS:  00007fba8c0af740(0000) GS:ffff8a1277c80000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 00000007c0933eb8 CR3: 0000000118244001 CR4: 00000000003706e0
    Call Trace:
     <TASK>
     vmw_view_bindings_add+0xf5/0x1b0 [vmwgfx]
     ? ___drm_dbg+0x8a/0xb0 [drm]
     vmw_cmd_dx_set_shader_res+0x8f/0xc0 [vmwgfx]
     vmw_execbuf_process+0x590/0x1360 [vmwgfx]
     vmw_execbuf_ioctl+0x173/0x370 [vmwgfx]
     ? __drm_dev_dbg+0xb4/0xe0 [drm]
     ? __pfx_vmw_execbuf_ioctl+0x10/0x10 [vmwgfx]
     drm_ioctl_kernel+0xbc/0x160 [drm]
     drm_ioctl+0x2d2/0x580 [drm]
     ? __pfx_vmw_execbuf_ioctl+0x10/0x10 [vmwgfx]
     ? do_fault+0x1a6/0x420
     vmw_generic_ioctl+0xbd/0x180 [vmwgfx]
     vmw_unlocked_ioctl+0x19/0x20 [vmwgfx]
     __x64_sys_ioctl+0x96/0xd0
     do_syscall_64+0x5d/0x90
     ? handle_mm_fault+0xe4/0x2f0
     ? debug_smp_processor_id+0x1b/0x30
     ? fpregs_assert_state_consistent+0x2e/0x50
     ? exit_to_user_mode_prepare+0x40/0x180
     ? irqentry_exit_to_user_mode+0xd/0x20
     ? irqentry_exit+0x3f/0x50
     ? exc_page_fault+0x8b/0x180
     entry_SYSCALL_64_after_hwframe+0x72/0xdc
    
    Signed-off-by: Zack Rusin <zackr@vmware.com>
    Cc: security@openanolis.org
    Reported-by: Ziming Zhang <ezrakiez@gmail.com>
    Testcase-found-by: Niels De Graef <ndegraef@redhat.com>
    Fixes: d80efd5cb3de ("drm/vmwgfx: Initial DX support")
    Cc: <stable@vger.kernel.org> # v4.3+
    Reviewed-by: Maaz Mombasawala<mombasawalam@vmware.com>
    Reviewed-by: Martin Krastev <krastevm@vmware.com>
    Link: https://patchwork.freedesktop.org/patch/msgid/20230616190934.54828-1-zack@kde.org
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 980cde3ac4bb4c500c8dbdc881937956842a8a3f
Author: David Hildenbrand <david@redhat.com>
Date:   Sat Aug 5 12:12:56 2023 +0200

    mm/gup: handle cont-PTE hugetlb pages correctly in gup_must_unshare() via GUP-fast
    
    commit 5805192c7b7257d290474cb1a3897d0567281bbc upstream.
    
    In contrast to most other GUP code, GUP-fast common page table walking
    code like gup_pte_range() also handles hugetlb pages.  But in contrast to
    other hugetlb page table walking code, it does not look at the hugetlb PTE
    abstraction whereby we have only a single logical hugetlb PTE per hugetlb
    page, even when using multiple cont-PTEs underneath -- which is for
    example what huge_ptep_get() abstracts.
    
    So when we have a hugetlb page that is mapped via cont-PTEs, GUP-fast
    might stumble over a PTE that does not map the head page of a hugetlb page
    -- not the first "head" PTE of such a cont mapping.
    
    Logically, the whole hugetlb page is mapped (entire_mapcount == 1), but we
    might end up calling gup_must_unshare() with a tail page of a hugetlb
    page.
    
    We only maintain a single PageAnonExclusive flag per hugetlb page (as
    hugetlb pages cannot get partially COW-shared), stored for the head page.
    That flag is clear for all tail pages.
    
    So when gup_must_unshare() ends up calling PageAnonExclusive() with a tail
    page of a hugetlb page:
    
    1) With CONFIG_DEBUG_VM_PGFLAGS
    
    Stumbles over the:
    
            VM_BUG_ON_PGFLAGS(PageHuge(page) && !PageHead(page), page);
    
    For example, when executing the COW selftests with 64k hugetlb pages on
    arm64:
    
      [   61.082187] page:00000000829819ff refcount:3 mapcount:1 mapping:0000000000000000 index:0x1 pfn:0x11ee11
      [   61.082842] head:0000000080f79bf7 order:4 entire_mapcount:1 nr_pages_mapped:0 pincount:2
      [   61.083384] anon flags: 0x17ffff80003000e(referenced|uptodate|dirty|head|mappedtodisk|node=0|zone=2|lastcpupid=0xfffff)
      [   61.084101] page_type: 0xffffffff()
      [   61.084332] raw: 017ffff800000000 fffffc00037b8401 0000000000000402 0000000200000000
      [   61.084840] raw: 0000000000000010 0000000000000000 00000000ffffffff 0000000000000000
      [   61.085359] head: 017ffff80003000e ffffd9e95b09b788 ffffd9e95b09b788 ffff0007ff63cf71
      [   61.085885] head: 0000000000000000 0000000000000002 00000003ffffffff 0000000000000000
      [   61.086415] page dumped because: VM_BUG_ON_PAGE(PageHuge(page) && !PageHead(page))
      [   61.086914] ------------[ cut here ]------------
      [   61.087220] kernel BUG at include/linux/page-flags.h:990!
      [   61.087591] Internal error: Oops - BUG: 00000000f2000800 [#1] SMP
      [   61.087999] Modules linked in: ...
      [   61.089404] CPU: 0 PID: 4612 Comm: cow Kdump: loaded Not tainted 6.5.0-rc4+ #3
      [   61.089917] Hardware name: QEMU KVM Virtual Machine, BIOS 0.0.0 02/06/2015
      [   61.090409] pstate: 604000c5 (nZCv daIF +PAN -UAO -TCO -DIT -SSBS BTYPE=--)
      [   61.090897] pc : gup_must_unshare.part.0+0x64/0x98
      [   61.091242] lr : gup_must_unshare.part.0+0x64/0x98
      [   61.091592] sp : ffff8000825eb940
      [   61.091826] x29: ffff8000825eb940 x28: 0000000000000000 x27: fffffc00037b8440
      [   61.092329] x26: 0400000000000001 x25: 0000000000080101 x24: 0000000000080000
      [   61.092835] x23: 0000000000080100 x22: ffff0000cffb9588 x21: ffff0000c8ec6b58
      [   61.093341] x20: 0000ffffad6b1000 x19: fffffc00037b8440 x18: ffffffffffffffff
      [   61.093850] x17: 2864616548656761 x16: 5021202626202965 x15: 6761702865677548
      [   61.094358] x14: 6567615028454741 x13: 2929656761702864 x12: 6165486567615021
      [   61.094858] x11: 00000000ffff7fff x10: 00000000ffff7fff x9 : ffffd9e958b7a1c0
      [   61.095359] x8 : 00000000000bffe8 x7 : c0000000ffff7fff x6 : 00000000002bffa8
      [   61.095873] x5 : ffff0008bb19e708 x4 : 0000000000000000 x3 : 0000000000000000
      [   61.096380] x2 : 0000000000000000 x1 : ffff0000cf6636c0 x0 : 0000000000000046
      [   61.096894] Call trace:
      [   61.097080]  gup_must_unshare.part.0+0x64/0x98
      [   61.097392]  gup_pte_range+0x3a8/0x3f0
      [   61.097662]  gup_pgd_range+0x1ec/0x280
      [   61.097942]  lockless_pages_from_mm+0x64/0x1a0
      [   61.098258]  internal_get_user_pages_fast+0xe4/0x1d0
      [   61.098612]  pin_user_pages_fast+0x58/0x78
      [   61.098917]  pin_longterm_test_start+0xf4/0x2b8
      [   61.099243]  gup_test_ioctl+0x170/0x3b0
      [   61.099528]  __arm64_sys_ioctl+0xa8/0xf0
      [   61.099822]  invoke_syscall.constprop.0+0x7c/0xd0
      [   61.100160]  el0_svc_common.constprop.0+0xe8/0x100
      [   61.100500]  do_el0_svc+0x38/0xa0
      [   61.100736]  el0_svc+0x3c/0x198
      [   61.100971]  el0t_64_sync_handler+0x134/0x150
      [   61.101280]  el0t_64_sync+0x17c/0x180
      [   61.101543] Code: aa1303e0 f00074c1 912b0021 97fffeb2 (d4210000)
    
    2) Without CONFIG_DEBUG_VM_PGFLAGS
    
    Always detects "not exclusive" for passed tail pages and refuses to PIN
    the tail pages R/O, as gup_must_unshare() == true.  GUP-fast will fallback
    to ordinary GUP.  As ordinary GUP properly considers the logical hugetlb
    PTE abstraction in hugetlb_follow_page_mask(), pinning the page will
    succeed when looking at the PageAnonExclusive on the head page only.
    
    So the only real effect of this is that with cont-PTE hugetlb pages, we'll
    always fallback from GUP-fast to ordinary GUP when not working on the head
    page, which ends up checking the head page and do the right thing.
    
    Consequently, the cow selftests pass with cont-PTE hugetlb pages as well
    without CONFIG_DEBUG_VM_PGFLAGS.
    
    Note that this only applies to anon hugetlb pages that are mapped using
    cont-PTEs: for example 64k hugetlb pages on a 4k arm64 kernel.
    
    ... and only when R/O-pinning (FOLL_PIN) such pages that are mapped into
    the page table R/O using GUP-fast.
    
    On production kernels (and even most debug kernels, that don't set
    CONFIG_DEBUG_VM_PGFLAGS) this patch should theoretically not be required
    to be backported.  But of course, it does not hurt.
    
    Link: https://lkml.kernel.org/r/20230805101256.87306-1-david@redhat.com
    Fixes: a7f226604170 ("mm/gup: trigger FAULT_FLAG_UNSHARE when R/O-pinning a possibly shared anonymous page")
    Signed-off-by: David Hildenbrand <david@redhat.com>
    Reported-by: Ryan Roberts <ryan.roberts@arm.com>
    Reviewed-by: Ryan Roberts <ryan.roberts@arm.com>
    Tested-by: Ryan Roberts <ryan.roberts@arm.com>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Cc: John Hubbard <jhubbard@nvidia.com>
    Cc: Jason Gunthorpe <jgg@nvidia.com>
    Cc: Peter Xu <peterx@redhat.com>
    Cc: Mike Kravetz <mike.kravetz@oracle.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2106dae0f19d6e7b8359f0993c49d12951548878
Author: David Hildenbrand <david@redhat.com>
Date:   Thu Aug 3 16:32:02 2023 +0200

    mm/gup: reintroduce FOLL_NUMA as FOLL_HONOR_NUMA_FAULT
    
    commit d74943a2f3cdade34e471b36f55f7979be656867 upstream.
    
    Unfortunately commit 474098edac26 ("mm/gup: replace FOLL_NUMA by
    gup_can_follow_protnone()") missed that follow_page() and
    follow_trans_huge_pmd() never implicitly set FOLL_NUMA because they really
    don't want to fail on PROT_NONE-mapped pages -- either due to NUMA hinting
    or due to inaccessible (PROT_NONE) VMAs.
    
    As spelled out in commit 0b9d705297b2 ("mm: numa: Support NUMA hinting
    page faults from gup/gup_fast"): "Other follow_page callers like KSM
    should not use FOLL_NUMA, or they would fail to get the pages if they use
    follow_page instead of get_user_pages."
    
    liubo reported [1] that smaps_rollup results are imprecise, because they
    miss accounting of pages that are mapped PROT_NONE.  Further, it's easy to
    reproduce that KSM no longer works on inaccessible VMAs on x86-64, because
    pte_protnone()/pmd_protnone() also indictaes "true" in inaccessible VMAs,
    and follow_page() refuses to return such pages right now.
    
    As KVM really depends on these NUMA hinting faults, removing the
    pte_protnone()/pmd_protnone() handling in GUP code completely is not
    really an option.
    
    To fix the issues at hand, let's revive FOLL_NUMA as FOLL_HONOR_NUMA_FAULT
    to restore the original behavior for now and add better comments.
    
    Set FOLL_HONOR_NUMA_FAULT independent of FOLL_FORCE in
    is_valid_gup_args(), to add that flag for all external GUP users.
    
    Note that there are three GUP-internal __get_user_pages() users that don't
    end up calling is_valid_gup_args() and consequently won't get
    FOLL_HONOR_NUMA_FAULT set.
    
    1) get_dump_page(): we really don't want to handle NUMA hinting
       faults. It specifies FOLL_FORCE and wouldn't have honored NUMA
       hinting faults already.
    2) populate_vma_page_range(): we really don't want to handle NUMA hinting
       faults. It specifies FOLL_FORCE on accessible VMAs, so it wouldn't have
       honored NUMA hinting faults already.
    3) faultin_vma_page_range(): we similarly don't want to handle NUMA
       hinting faults.
    
    To make the combination of FOLL_FORCE and FOLL_HONOR_NUMA_FAULT work in
    inaccessible VMAs properly, we have to perform VMA accessibility checks in
    gup_can_follow_protnone().
    
    As GUP-fast should reject such pages either way in
    pte_access_permitted()/pmd_access_permitted() -- for example on x86-64 and
    arm64 that both implement pte_protnone() -- let's just always fallback to
    ordinary GUP when stumbling over pte_protnone()/pmd_protnone().
    
    As Linus notes [2], honoring NUMA faults might only make sense for
    selected GUP users.
    
    So we should really see if we can instead let relevant GUP callers specify
    it manually, and not trigger NUMA hinting faults from GUP as default.
    Prepare for that by making FOLL_HONOR_NUMA_FAULT an external GUP flag and
    adding appropriate documenation.
    
    While at it, remove a stale comment from follow_trans_huge_pmd(): That
    comment for pmd_protnone() was added in commit 2b4847e73004 ("mm: numa:
    serialise parallel get_user_page against THP migration"), which noted:
    
            THP does not unmap pages due to a lack of support for migration
            entries at a PMD level.  This allows races with get_user_pages
    
    Nowadays, we do have PMD migration entries, so the comment no longer
    applies.  Let's drop it.
    
    [1] https://lore.kernel.org/r/20230726073409.631838-1-liubo254@huawei.com
    [2] https://lore.kernel.org/r/CAHk-=wgRiP_9X0rRdZKT8nhemZGNateMtb366t37d8-x7VRs=g@mail.gmail.com
    
    Link: https://lkml.kernel.org/r/20230803143208.383663-2-david@redhat.com
    Fixes: 474098edac26 ("mm/gup: replace FOLL_NUMA by gup_can_follow_protnone()")
    Signed-off-by: David Hildenbrand <david@redhat.com>
    Reported-by: liubo <liubo254@huawei.com>
    Closes: https://lore.kernel.org/r/20230726073409.631838-1-liubo254@huawei.com
    Reported-by: Peter Xu <peterx@redhat.com>
    Closes: https://lore.kernel.org/all/ZMKJjDaqZ7FW0jfe@x1n/
    Acked-by: Mel Gorman <mgorman@techsingularity.net>
    Acked-by: Peter Xu <peterx@redhat.com>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Jason Gunthorpe <jgg@ziepe.ca>
    Cc: John Hubbard <jhubbard@nvidia.com>
    Cc: Linus Torvalds <torvalds@linux-foundation.org>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Mel Gorman <mgorman@suse.de>
    Cc: Paolo Bonzini <pbonzini@redhat.com>
    Cc: Shuah Khan <shuah@kernel.org>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit a55dd240a9f1de426e1113e7cd1c226daebe7f9f
Author: Suren Baghdasaryan <surenb@google.com>
Date:   Fri Aug 4 08:27:19 2023 -0700

    mm: enable page walking API to lock vmas during the walk
    
    commit 49b0638502da097c15d46cd4e871dbaa022caf7c upstream.
    
    walk_page_range() and friends often operate under write-locked mmap_lock.
    With introduction of vma locks, the vmas have to be locked as well during
    such walks to prevent concurrent page faults in these areas.  Add an
    additional member to mm_walk_ops to indicate locking requirements for the
    walk.
    
    The change ensures that page walks which prevent concurrent page faults
    by write-locking mmap_lock, operate correctly after introduction of
    per-vma locks.  With per-vma locks page faults can be handled under vma
    lock without taking mmap_lock at all, so write locking mmap_lock would
    not stop them.  The change ensures vmas are properly locked during such
    walks.
    
    A sample issue this solves is do_mbind() performing queue_pages_range()
    to queue pages for migration.  Without this change a concurrent page
    can be faulted into the area and be left out of migration.
    
    Link: https://lkml.kernel.org/r/20230804152724.3090321-2-surenb@google.com
    Signed-off-by: Suren Baghdasaryan <surenb@google.com>
    Suggested-by: Linus Torvalds <torvalds@linuxfoundation.org>
    Suggested-by: Jann Horn <jannh@google.com>
    Cc: David Hildenbrand <david@redhat.com>
    Cc: Davidlohr Bueso <dave@stgolabs.net>
    Cc: Hugh Dickins <hughd@google.com>
    Cc: Johannes Weiner <hannes@cmpxchg.org>
    Cc: Laurent Dufour <ldufour@linux.ibm.com>
    Cc: Liam Howlett <liam.howlett@oracle.com>
    Cc: Matthew Wilcox (Oracle) <willy@infradead.org>
    Cc: Michal Hocko <mhocko@suse.com>
    Cc: Michel Lespinasse <michel@lespinasse.org>
    Cc: Peter Xu <peterx@redhat.com>
    Cc: Vlastimil Babka <vbabka@suse.cz>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 2dcc0e4b3c1c085e361191ea44c567129f3ef466
Author: Ayush Jain <ayush.jain3@amd.com>
Date:   Tue Aug 8 07:43:47 2023 -0500

    selftests/mm: FOLL_LONGTERM need to be updated to 0x100
    
    commit 1738b949625c7e17a454b25de33f1f415da3db69 upstream.
    
    After commit 2c2241081f7d ("mm/gup: move private gup FOLL_ flags to
    internal.h") FOLL_LONGTERM flag value got updated from 0x10000 to 0x100 at
    include/linux/mm_types.h.
    
    As hmm.hmm_device_private.hmm_gup_test uses FOLL_LONGTERM Updating same
    here as well.
    
    Before this change test goes in an infinite assert loop in
    hmm.hmm_device_private.hmm_gup_test
    ==========================================================
     RUN           hmm.hmm_device_private.hmm_gup_test ...
    hmm-tests.c:1962:hmm_gup_test:Expected HMM_DMIRROR_PROT_WRITE..
    ..(2) == m[2] (34)
    hmm-tests.c:157:hmm_gup_test:Expected ret (-1) == 0 (0)
    hmm-tests.c:157:hmm_gup_test:Expected ret (-1) == 0 (0)
    ...
    ==========================================================
    
     Call Trace:
     <TASK>
     ? sched_clock+0xd/0x20
     ? __lock_acquire.constprop.0+0x120/0x6c0
     ? ktime_get+0x2c/0xd0
     ? sched_clock+0xd/0x20
     ? local_clock+0x12/0xd0
     ? lock_release+0x26e/0x3b0
     pin_user_pages_fast+0x4c/0x70
     gup_test_ioctl+0x4ff/0xbb0
     ? gup_test_ioctl+0x68c/0xbb0
     __x64_sys_ioctl+0x99/0xd0
     do_syscall_64+0x60/0x90
     ? syscall_exit_to_user_mode+0x2a/0x50
     ? do_syscall_64+0x6d/0x90
     ? syscall_exit_to_user_mode+0x2a/0x50
     ? do_syscall_64+0x6d/0x90
     ? irqentry_exit_to_user_mode+0xd/0x20
     ? irqentry_exit+0x3f/0x50
     ? exc_page_fault+0x96/0x200
     entry_SYSCALL_64_after_hwframe+0x72/0xdc
     RIP: 0033:0x7f6aaa31aaff
    
    After this change test is able to pass successfully.
    
    Link: https://lkml.kernel.org/r/20230808124347.79163-1-ayush.jain3@amd.com
    Fixes: 2c2241081f7d ("mm/gup: move private gup FOLL_ flags to internal.h")
    Signed-off-by: Ayush Jain <ayush.jain3@amd.com>
    Reviewed-by: Raghavendra K T <raghavendra.kt@amd.com>
    Reviewed-by: John Hubbard <jhubbard@nvidia.com>
    Acked-by: David Hildenbrand <david@redhat.com>
    Cc: Jason Gunthorpe <jgg@nvidia.com>
    Cc: <stable@vger.kernel.org>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c02c4e76ccb9a987ecc52b3dca5f5471aab7a703
Author: Takashi Iwai <tiwai@suse.de>
Date:   Wed Aug 23 18:16:25 2023 +0200

    ALSA: ymfpci: Fix the missing snd_card_free() call at probe error
    
    commit 1d0eb6143c1e85d3f9a3f5a616ee7e5dc351d33b upstream.
    
    Like a few other drivers, YMFPCI driver needs to clean up with
    snd_card_free() call at an error path of the probe; otherwise the
    other devres resources are released before the card and it results in
    the UAF.
    
    This patch uses the helper for handling the probe error gracefully.
    
    Fixes: f33fc1576757 ("ALSA: ymfpci: Create card with device-managed snd_devm_card_new()")
    Cc: <stable@vger.kernel.org>
    Reported-and-tested-by: Takashi Yano <takashi.yano@nifty.ne.jp>
    Closes: https://lore.kernel.org/r/20230823135846.1812-1-takashi.yano@nifty.ne.jp
    Link: https://lore.kernel.org/r/20230823161625.5807-1-tiwai@suse.de
    Signed-off-by: Takashi Iwai <tiwai@suse.de>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 6218f967d579cd38a1126f6246aad2d19b56088a
Author: Hugh Dickins <hughd@google.com>
Date:   Tue Aug 22 22:14:47 2023 -0700

    shmem: fix smaps BUG sleeping while atomic
    
    commit e5548f85b4527c4c803b7eae7887c10bf8f90c97 upstream.
    
    smaps_pte_hole_lookup() is calling shmem_partial_swap_usage() with page
    table lock held: but shmem_partial_swap_usage() does cond_resched_rcu() if
    need_resched(): "BUG: sleeping function called from invalid context".
    
    Since shmem_partial_swap_usage() is designed to count across a range, but
    smaps_pte_hole_lookup() only calls it for a single page slot, just break
    out of the loop on the last or only page, before checking need_resched().
    
    Link: https://lkml.kernel.org/r/6fe3b3ec-abdf-332f-5c23-6a3b3a3b11a9@google.com
    Fixes: 230100321518 ("mm/smaps: simplify shmem handling of pte holes")
    Signed-off-by: Hugh Dickins <hughd@google.com>
    Acked-by: Peter Xu <peterx@redhat.com>
    Cc: <stable@vger.kernel.org>    [5.16+]
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 535cdce0713e7a8676b79601edbf0e5be16a4f85
Author: Rik van Riel <riel@surriel.com>
Date:   Thu Aug 17 13:57:59 2023 -0400

    mm,ima,kexec,of: use memblock_free_late from ima_free_kexec_buffer
    
    commit f0362a253606e2031f8d61c74195d4d6556e12a4 upstream.
    
    The code calling ima_free_kexec_buffer runs long after the memblock
    allocator has already been torn down, potentially resulting in a use
    after free in memblock_isolate_range.
    
    With KASAN or KFENCE, this use after free will result in a BUG
    from the idle task, and a subsequent kernel panic.
    
    Switch ima_free_kexec_buffer over to memblock_free_late to avoid
    that issue.
    
    Fixes: fee3ff99bc67 ("powerpc: Move arch independent ima kexec functions to drivers/of/kexec.c")
    Cc: stable@kernel.org
    Signed-off-by: Rik van Riel <riel@surriel.com>
    Suggested-by: Mike Rappoport <rppt@kernel.org>
    Link: https://lore.kernel.org/r/20230817135759.0888e5ef@imladris.surriel.com
    Signed-off-by: Rob Herring <robh@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit c856ff4acd94fceb745982c9c0e8bcc8e421efd5
Author: Andrey Skvortsov <andrej.skvortzov@gmail.com>
Date:   Sat Aug 5 11:48:47 2023 +0300

    clk: Fix slab-out-of-bounds error in devm_clk_release()
    
    commit 66fbfb35da47f391bdadf9fa7ceb88af4faa9022 upstream.
    
    Problem can be reproduced by unloading snd_soc_simple_card, because in
    devm_get_clk_from_child() devres data is allocated as `struct clk`, but
    devm_clk_release() expects devres data to be `struct devm_clk_state`.
    
    KASAN report:
     ==================================================================
     BUG: KASAN: slab-out-of-bounds in devm_clk_release+0x20/0x54
     Read of size 8 at addr ffffff800ee09688 by task (udev-worker)/287
    
     Call trace:
      dump_backtrace+0xe8/0x11c
      show_stack+0x1c/0x30
      dump_stack_lvl+0x60/0x78
      print_report+0x150/0x450
      kasan_report+0xa8/0xf0
      __asan_load8+0x78/0xa0
      devm_clk_release+0x20/0x54
      release_nodes+0x84/0x120
      devres_release_all+0x144/0x210
      device_unbind_cleanup+0x1c/0xac
      really_probe+0x2f0/0x5b0
      __driver_probe_device+0xc0/0x1f0
      driver_probe_device+0x68/0x120
      __driver_attach+0x140/0x294
      bus_for_each_dev+0xec/0x160
      driver_attach+0x38/0x44
      bus_add_driver+0x24c/0x300
      driver_register+0xf0/0x210
      __platform_driver_register+0x48/0x54
      asoc_simple_card_init+0x24/0x1000 [snd_soc_simple_card]
      do_one_initcall+0xac/0x340
      do_init_module+0xd0/0x300
      load_module+0x2ba4/0x3100
      __do_sys_init_module+0x2c8/0x300
      __arm64_sys_init_module+0x48/0x5c
      invoke_syscall+0x64/0x190
      el0_svc_common.constprop.0+0x124/0x154
      do_el0_svc+0x44/0xdc
      el0_svc+0x14/0x50
      el0t_64_sync_handler+0xec/0x11c
      el0t_64_sync+0x14c/0x150
    
     Allocated by task 287:
      kasan_save_stack+0x38/0x60
      kasan_set_track+0x28/0x40
      kasan_save_alloc_info+0x20/0x30
      __kasan_kmalloc+0xac/0xb0
      __kmalloc_node_track_caller+0x6c/0x1c4
      __devres_alloc_node+0x44/0xb4
      devm_get_clk_from_child+0x44/0xa0
      asoc_simple_parse_clk+0x1b8/0x1dc [snd_soc_simple_card_utils]
      simple_parse_node.isra.0+0x1ec/0x230 [snd_soc_simple_card]
      simple_dai_link_of+0x1bc/0x334 [snd_soc_simple_card]
      __simple_for_each_link+0x2ec/0x320 [snd_soc_simple_card]
      asoc_simple_probe+0x468/0x4dc [snd_soc_simple_card]
      platform_probe+0x90/0xf0
      really_probe+0x118/0x5b0
      __driver_probe_device+0xc0/0x1f0
      driver_probe_device+0x68/0x120
      __driver_attach+0x140/0x294
      bus_for_each_dev+0xec/0x160
      driver_attach+0x38/0x44
      bus_add_driver+0x24c/0x300
      driver_register+0xf0/0x210
      __platform_driver_register+0x48/0x54
      asoc_simple_card_init+0x24/0x1000 [snd_soc_simple_card]
      do_one_initcall+0xac/0x340
      do_init_module+0xd0/0x300
      load_module+0x2ba4/0x3100
      __do_sys_init_module+0x2c8/0x300
      __arm64_sys_init_module+0x48/0x5c
      invoke_syscall+0x64/0x190
      el0_svc_common.constprop.0+0x124/0x154
      do_el0_svc+0x44/0xdc
      el0_svc+0x14/0x50
      el0t_64_sync_handler+0xec/0x11c
      el0t_64_sync+0x14c/0x150
    
     The buggy address belongs to the object at ffffff800ee09600
      which belongs to the cache kmalloc-256 of size 256
     The buggy address is located 136 bytes inside of
      256-byte region [ffffff800ee09600, ffffff800ee09700)
    
     The buggy address belongs to the physical page:
     page:000000002d97303b refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x4ee08
     head:000000002d97303b order:1 compound_mapcount:0 compound_pincount:0
     flags: 0x10200(slab|head|zone=0)
     raw: 0000000000010200 0000000000000000 dead000000000122 ffffff8002c02480
     raw: 0000000000000000 0000000080100010 00000001ffffffff 0000000000000000
     page dumped because: kasan: bad access detected
    
     Memory state around the buggy address:
      ffffff800ee09580: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
      ffffff800ee09600: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
     >ffffff800ee09680: 00 fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
                           ^
      ffffff800ee09700: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
      ffffff800ee09780: fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc fc
     ==================================================================
    
    Fixes: abae8e57e49a ("clk: generalize devm_clk_get() a bit")
    Signed-off-by: Andrey Skvortsov <andrej.skvortzov@gmail.com>
    Link: https://lore.kernel.org/r/20230805084847.3110586-1-andrej.skvortzov@gmail.com
    Signed-off-by: Stephen Boyd <sboyd@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ed2e9e10a1305f041f3434d9965b683e5bd883fc
Author: Benjamin Coddington <bcodding@redhat.com>
Date:   Fri Jun 30 09:18:13 2023 -0400

    NFSv4: Fix dropped lock for racing OPEN and delegation return
    
    commit 1cbc11aaa01f80577b67ae02c73ee781112125fd upstream.
    
    Commmit f5ea16137a3f ("NFSv4: Retry LOCK on OLD_STATEID during delegation
    return") attempted to solve this problem by using nfs4's generic async error
    handling, but introduced a regression where v4.0 lock recovery would hang.
    The additional complexity introduced by overloading that error handling is
    not necessary for this case.  This patch expects that commit to be
    reverted.
    
    The problem as originally explained in the above commit is:
    
        There's a small window where a LOCK sent during a delegation return can
        race with another OPEN on client, but the open stateid has not yet been
        updated.  In this case, the client doesn't handle the OLD_STATEID error
        from the server and will lose this lock, emitting:
        "NFS: nfs4_handle_delegation_recall_error: unhandled error -10024".
    
    Fix this by using the old_stateid refresh helpers if the server replies
    with OLD_STATEID.
    
    Suggested-by: Trond Myklebust <trondmy@hammerspace.com>
    Signed-off-by: Benjamin Coddington <bcodding@redhat.com>
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit ed29b5fbf07f1d3f11398611816d2dd7bdeda0a0
Author: André Apitzsch <git@apitzsch.eu>
Date:   Sat Aug 19 09:12:15 2023 +0200

    platform/x86: ideapad-laptop: Add support for new hotkeys found on ThinkBook 14s Yoga ITL
    
    commit a260f7d726fde52c0278bd3fa085a758639bcee2 upstream.
    
    The Lenovo Thinkbook 14s Yoga ITL has 4 new symbols/shortcuts on their
    F9-F11 and PrtSc keys:
    
    F9:    Has a symbol of a head with a headset, the manual says "Service key"
    F10:   Has a symbol of a telephone horn which has been picked up from the
           receiver, the manual says: "Answer incoming calls"
    F11:   Has a symbol of a telephone horn which is resting on the receiver,
           the manual says: "Reject incoming calls"
    PrtSc: Has a symbol of a siccor and a dashed ellipse, the manual says:
           "Open the Windows 'Snipping' Tool app"
    
    This commit adds support for these 4 new hkey events.
    
    Signed-off-by: André Apitzsch <git@apitzsch.eu>
    Link: https://lore.kernel.org/r/20230819-lenovo_keys-v1-1-9d34eac88e0a@apitzsch.eu
    Reviewed-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 3bdeb65ca9c8272e1308a7d2a4aa25506fc11013
Author: Swapnil Devesh <me@sidevesh.com>
Date:   Fri Aug 18 18:09:47 2023 +0530

    platform/x86: lenovo-ymc: Add Lenovo Yoga 7 14ACN6 to ec_trigger_quirk_dmi_table
    
    commit db35610a181c18f7a521a2e157f7acdef7ce425f upstream.
    
    This adds my laptop Lenovo Yoga 7 14ACN6, with Product Name: 82N7
    (from `dmidecode -t1 | grep "Product Name"`) to
    the ec_trigger_quirk_dmi_table, have tested that this is required
    for the YMC driver to work correctly on this model.
    
    Signed-off-by: Swapnil Devesh <me@sidevesh.com>
    Reviewed-by: Gergő Köteles <soyer@irl.hu>
    Link: https://lore.kernel.org/r/18a08a8b173.895ef3b250414.1213194126082324071@sidevesh.com
    Reviewed-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Hans de Goede <hdegoede@redhat.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 28eee9b4e819a56fbd0cadf5e9cc4d43e54d0d35
Author: Ping-Ke Shih <pkshih@realtek.com>
Date:   Fri Aug 18 09:40:04 2023 +0800

    wifi: mac80211: limit reorder_buf_filtered to avoid UBSAN warning
    
    commit b98c16107cc1647242abbd11f234c05a3a5864f6 upstream.
    
    The commit 06470f7468c8 ("mac80211: add API to allow filtering frames in BA sessions")
    added reorder_buf_filtered to mark frames filtered by firmware, and it
    can only work correctly if hw.max_rx_aggregation_subframes <= 64 since
    it stores the bitmap in a u64 variable.
    
    However, new HE or EHT devices can support BlockAck number up to 256 or
    1024, and then using a higher subframe index leads UBSAN warning:
    
     UBSAN: shift-out-of-bounds in net/mac80211/rx.c:1129:39
     shift exponent 215 is too large for 64-bit type 'long long unsigned int'
     Call Trace:
      <IRQ>
      dump_stack_lvl+0x48/0x70
      dump_stack+0x10/0x20
      __ubsan_handle_shift_out_of_bounds+0x1ac/0x360
      ieee80211_release_reorder_frame.constprop.0.cold+0x64/0x69 [mac80211]
      ieee80211_sta_reorder_release+0x9c/0x400 [mac80211]
      ieee80211_prepare_and_rx_handle+0x1234/0x1420 [mac80211]
      ieee80211_rx_list+0xaef/0xf60 [mac80211]
      ieee80211_rx_napi+0x53/0xd0 [mac80211]
    
    Since only old hardware that supports <=64 BlockAck uses
    ieee80211_mark_rx_ba_filtered_frames(), limit the use as it is, so add a
    WARN_ONCE() and comment to note to avoid using this function if hardware
    capability is not suitable.
    
    Signed-off-by: Ping-Ke Shih <pkshih@realtek.com>
    Link: https://lore.kernel.org/r/20230818014004.16177-1-pkshih@realtek.com
    [edit commit message]
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit a3009e19f09baf3264975d469a2d63d48855bb8a
Author: Michael Ellerman <mpe@ellerman.id.au>
Date:   Wed Aug 23 14:51:39 2023 +1000

    ibmveth: Use dcbf rather than dcbfl
    
    commit bfedba3b2c7793ce127680bc8f70711e05ec7a17 upstream.
    
    When building for power4, newer binutils don't recognise the "dcbfl"
    extended mnemonic.
    
    dcbfl RA, RB is equivalent to dcbf RA, RB, 1.
    
    Switch to "dcbf" to avoid the build error.
    
    Signed-off-by: Michael Ellerman <mpe@ellerman.id.au>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 06a128cbe40e1b14a8e52b05429e234b7ec89dd8
Author: Srinivas Goud <srinivas.goud@amd.com>
Date:   Mon Aug 21 15:00:16 2023 +0530

    spi: spi-cadence: Fix data corruption issues in slave mode
    
    commit 627d05a41ca1fbb9d390f9513af262f001f261f7 upstream.
    
    Remove 10us delay in cdns_spi_process_fifo() (called from cdns_spi_irq())
    to fix data corruption issue on Master side when this driver
    configured in Slave mode, as Slave is failed to prepare the date
    on time due to above delay.
    
    Add 10us delay before processing the RX FIFO as TX empty doesn't
    guarantee valid data in RX FIFO.
    
    Signed-off-by: Srinivas Goud <srinivas.goud@amd.com>
    Reviewed-by: Charles Keepax <ckeepax@opensource.cirrus.com>
    Tested-by: Charles Keepax <ckeepax@opensource.cirrus.com>
    Link: https://lore.kernel.org/r/1692610216-217644-1-git-send-email-srinivas.goud@amd.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 28b605e939b3046177b869943dea4c1acf19acbc
Author: Charles Keepax <ckeepax@opensource.cirrus.com>
Date:   Wed Aug 23 09:53:08 2023 +0100

    ASoC: cs35l41: Correct amp_gain_tlv values
    
    commit 1613781d7e8a93618ff3a6b37f81f06769b53717 upstream.
    
    The current analog gain TLV seems to have completely incorrect values in
    it. The gain starts at 0.5dB, proceeds in 1dB steps, and has no mute
    value, correct the control to match.
    
    Signed-off-by: Charles Keepax <ckeepax@opensource.cirrus.com>
    Link: https://lore.kernel.org/r/20230823085308.753572-1-ckeepax@opensource.cirrus.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 8c7fd1baeed017c05f248ba69a8d6ec45f3bb5e1
Author: BrenoRCBrito <brenorcbrito@gmail.com>
Date:   Fri Aug 18 18:14:16 2023 -0300

    ASoC: amd: yc: Add VivoBook Pro 15 to quirks list for acp6x
    
    commit 3b1f08833c45d0167741e4097b0150e7cf086102 upstream.
    
    VivoBook Pro 15 Ryzen Edition uses Ryzen 6800H processor, and adding to
     quirks list for acp6x will enable internal mic.
    
    Signed-off-by: BrenoRCBrito <brenorcbrito@gmail.com>
    Link: https://lore.kernel.org/r/20230818211417.32167-1-brenorcbrito@gmail.com
    Signed-off-by: Mark Brown <broonie@kernel.org>
    Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>

commit 1cc2d968504311c5e02744b9bccbc7e803ccb049
Author: Hangbin Liu <liuhangbin@gmail.com>
Date:   Wed Aug 23 15:19:04 2023 +0800

    bonding: fix macvlan over alb bond support
    
    [ Upstream commit e74216b8def3803e98ae536de78733e9d7f3b109 ]
    
    The commit 14af9963ba1e ("bonding: Support macvlans on top of tlb/rlb mode
    bonds") aims to enable the use of macvlans on top of rlb bond mode. However,
    the current rlb bond mode only handles ARP packets to update remote neighbor
    entries. This causes an issue when a macvlan is on top of the bond, and
    remote devices send packets to the macvlan using the bond's MAC address
    as the destination. After delivering the packets to the macvlan, the macvlan
    will rejects them as the MAC address is incorrect. Consequently, this commit
    makes macvlan over bond non-functional.
    
    To address this problem, one potential solution is to check for the presence
    of a macvlan port on the bond device using netif_is_macvlan_port(bond->dev)
    and return NULL in the rlb_arp_xmit() function. However, this approach
    doesn't fully resolve the situation when a VLAN exists between the bond and
    macvlan.
    
    So let's just do a partial revert for commit 14af9963ba1e in rlb_arp_xmit().
    As the comment said, Don't modify or load balance ARPs that do not originate
    locally.
    
    Fixes: 14af9963ba1e ("bonding: Support macvlans on top of tlb/rlb mode bonds")
    Reported-by: susan.zheng@veritas.com
    Closes: https://bugzilla.redhat.com/show_bug.cgi?id=2117816
    Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
    Acked-by: Jay Vosburgh <jay.vosburgh@canonical.com>
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit abdf60d759f7236459049bfc4b7a1f0be654cead
Author: Ido Schimmel <idosch@nvidia.com>
Date:   Wed Aug 23 09:43:48 2023 +0300

    rtnetlink: Reject negative ifindexes in RTM_NEWLINK
    
    [ Upstream commit 30188bd7838c16a98a520db1fe9df01ffc6ed368 ]
    
    Negative ifindexes are illegal, but the kernel does not validate the
    ifindex in the ancillary header of RTM_NEWLINK messages, resulting in
    the kernel generating a warning [1] when such an ifindex is specified.
    
    Fix by rejecting negative ifindexes.
    
    [1]
    WARNING: CPU: 0 PID: 5031 at net/core/dev.c:9593 dev_index_reserve+0x1a2/0x1c0 net/core/dev.c:9593
    [...]
    Call Trace:
     <TASK>
     register_netdevice+0x69a/0x1490 net/core/dev.c:10081
     br_dev_newlink+0x27/0x110 net/bridge/br_netlink.c:1552
     rtnl_newlink_create net/core/rtnetlink.c:3471 [inline]
     __rtnl_newlink+0x115e/0x18c0 net/core/rtnetlink.c:3688
     rtnl_newlink+0x67/0xa0 net/core/rtnetlink.c:3701
     rtnetlink_rcv_msg+0x439/0xd30 net/core/rtnetlink.c:6427
     netlink_rcv_skb+0x16b/0x440 net/netlink/af_netlink.c:2545
     netlink_unicast_kernel net/netlink/af_netlink.c:1342 [inline]
     netlink_unicast+0x536/0x810 net/netlink/af_netlink.c:1368
     netlink_sendmsg+0x93c/0xe40 net/netlink/af_netlink.c:1910
     sock_sendmsg_nosec net/socket.c:728 [inline]
     sock_sendmsg+0xd9/0x180 net/socket.c:751
     ____sys_sendmsg+0x6ac/0x940 net/socket.c:2538
     ___sys_sendmsg+0x135/0x1d0 net/socket.c:2592
     __sys_sendmsg+0x117/0x1e0 net/socket.c:2621
     do_syscall_x64 arch/x86/entry/common.c:50 [inline]
     do_syscall_64+0x38/0xb0 arch/x86/entry/common.c:80
     entry_SYSCALL_64_after_hwframe+0x63/0xcd
    
    Fixes: 38f7b870d4a6 ("[RTNETLINK]: Link creation API")
    Reported-by: syzbot+5ba06978f34abb058571@syzkaller.appspotmail.com
    Signed-off-by: Ido Schimmel <idosch@nvidia.com>
    Reviewed-by: Jiri Pirko <jiri@nvidia.com>
    Reviewed-by: Jakub Kicinski <kuba@kernel.org>
    Link: https://lore.kernel.org/r/20230823064348.2252280-1-idosch@nvidia.com
    Signed-off-by: Paolo Abeni <pabeni@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit beceaf2e5e337d1ffff2818e0e291e23bd456a5e
Author: Florian Westphal <fw@strlen.de>
Date:   Tue Aug 22 22:03:57 2023 +0200

    netfilter: nf_tables: defer gc run if previous batch is still pending
    
    [ Upstream commit 8e51830e29e12670b4c10df070a4ea4c9593e961 ]
    
    Don't queue more gc work, else we may queue the same elements multiple
    times.
    
    If an element is flagged as dead, this can mean that either the previous
    gc request was invalidated/discarded by a transaction or that the previous
    request is still pending in the system work queue.
    
    The latter will happen if the gc interval is set to a very low value,
    e.g. 1ms, and system work queue is backlogged.
    
    The sets refcount is 1 if no previous gc requeusts are queued, so add
    a helper for this and skip gc run if old requests are pending.
    
    Add a helper for this and skip the gc run in this case.
    
    Fixes: f6c383b8c31a ("netfilter: nf_tables: adapt set backend to use GC transaction API")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Reviewed-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 16cc42cc00fb263c3e4b81b3ec67c51ce685bf4c
Author: Florian Westphal <fw@strlen.de>
Date:   Tue Aug 22 19:49:52 2023 +0200

    netfilter: nf_tables: fix out of memory error handling
    
    [ Upstream commit 5e1be4cdc98c989d5387ce94ff15b5ad06a5b681 ]
    
    Several instances of pipapo_resize() don't propagate allocation failures,
    this causes a crash when fault injection is enabled for gfp_kernel slabs.
    
    Fixes: 3c4287f62044 ("nf_tables: Add set type for arbitrary concatenation of ranges")
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Reviewed-by: Stefano Brivio <sbrivio@redhat.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit e05b2a9f03b32ae0125e3d7b9b76d55292991587
Author: Pablo Neira Ayuso <pablo@netfilter.org>
Date:   Mon Aug 21 14:33:32 2023 +0200

    netfilter: nf_tables: use correct lock to protect gc_list
    
    [ Upstream commit 8357bc946a2abc2a10ca40e5a2105d2b4c57515e ]
    
    Use nf_tables_gc_list_lock spinlock, not nf_tables_destroy_list_lock to
    protect the gc list.
    
    Fixes: 5f68718b34a5 ("netfilter: nf_tables: GC transaction API to avoid race with control plane")
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit e07e68823116563bdbc49cef185cda6f463bc534
Author: Pablo Neira Ayuso <pablo@netfilter.org>
Date:   Fri Aug 18 01:13:52 2023 +0200

    netfilter: nf_tables: GC transaction race with abort path
    
    [ Upstream commit 720344340fb9be2765bbaab7b292ece0a4570eae ]
    
    Abort path is missing a synchronization point with GC transactions. Add
    GC sequence number hence any GC transaction losing race will be
    discarded.
    
    Fixes: 5f68718b34a5 ("netfilter: nf_tables: GC transaction API to avoid race with control plane")
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 4167aa477abcf62b0dfda51f3513280fa73cd588
Author: Pablo Neira Ayuso <pablo@netfilter.org>
Date:   Fri Aug 18 01:13:31 2023 +0200

    netfilter: nf_tables: flush pending destroy work before netlink notifier
    
    [ Upstream commit 2c9f0293280e258606e54ed2b96fa71498432eae ]
    
    Destroy work waits for the RCU grace period then it releases the objects
    with no mutex held. All releases objects follow this path for
    transactions, therefore, order is guaranteed and references to top-level
    objects in the hierarchy remain valid.
    
    However, netlink notifier might interfer with pending destroy work.
    rcu_barrier() is not correct because objects are not release via RCU
    callback. Flush destroy work before releasing objects from netlink
    notifier path.
    
    Fixes: d4bc8271db21 ("netfilter: nf_tables: netlink notifier might race to release objects")
    Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit e290509f8be5cb50cb9ccddc11c28db6016ce25e
Author: Florian Westphal <fw@strlen.de>
Date:   Thu Aug 17 20:28:32 2023 +0200

    netfilter: nf_tables: validate all pending tables
    
    [ Upstream commit 4b80ced971b0d118f9a11dd503a5833a5016de92 ]
    
    We have to validate all tables in the transaction that are in
    VALIDATE_DO state, the blamed commit below did not move the break
    statement to its right location so we only validate one table.
    
    Moreover, we can't init table->validate to _SKIP when a table object
    is allocated.
    
    If we do, then if a transcaction creates a new table and then
    fails the transaction, nfnetlink will loop and nft will hang until
    user cancels the command.
    
    Add back the pernet state as a place to stash the last state encountered.
    This is either _DO (we hit an error during commit validation) or _SKIP
    (transaction passed all checks).
    
    Fixes: 00c320f9b755 ("netfilter: nf_tables: make validation state per table")
    Reported-by: Pablo Neira Ayuso <pablo@netfilter.org>
    Signed-off-by: Florian Westphal <fw@strlen.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 711ffb6fa5a0a69a5c007e7b683c8aaa9b1cb6de
Author: Andrii Staikov <andrii.staikov@intel.com>
Date:   Tue Aug 22 15:16:53 2023 -0700

    i40e: fix potential NULL pointer dereferencing of pf->vf i40e_sync_vsi_filters()
    
    [ Upstream commit 9525a3c38accd2e186f52443e35e633e296cc7f5 ]
    
    Add check for pf->vf not being NULL before dereferencing
    pf->vf[vsi->vf_id] in updating VSI filter sync.
    Add a similar check before dereferencing !pf->vf[vsi->vf_id].trusted
    in the condition for clearing promisc mode bit.
    
    Fixes: c87c938f62d8 ("i40e: Add VF VLAN pruning")
    Signed-off-by: Andrii Staikov <andrii.staikov@intel.com>
    Signed-off-by: Aleksandr Loktionov <aleksandr.loktionov@intel.com>
    Tested-by: Rafal Romanowski <rafal.romanowski@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 7ac409385e1ca746b1191415a1e5afa8b5f9b0fc
Author: Jamal Hadi Salim <jhs@mojatatu.com>
Date:   Tue Aug 22 06:12:31 2023 -0400

    net/sched: fix a qdisc modification with ambiguous command request
    
    [ Upstream commit da71714e359b64bd7aab3bd56ec53f307f058133 ]
    
    When replacing an existing root qdisc, with one that is of the same kind, the
    request boils down to essentially a parameterization change  i.e not one that
    requires allocation and grafting of a new qdisc. syzbot was able to create a
    scenario which resulted in a taprio qdisc replacing an existing taprio qdisc
    with a combination of NLM_F_CREATE, NLM_F_REPLACE and NLM_F_EXCL leading to
    create and graft scenario.
    The fix ensures that only when the qdisc kinds are different that we should
    allow a create and graft, otherwise it goes into the "change" codepath.
    
    While at it, fix the code and comments to improve readability.
    
    While syzbot was able to create the issue, it did not zone on the root cause.
    Analysis from Vladimir Oltean <vladimir.oltean@nxp.com> helped narrow it down.
    
    v1->V2 changes:
    - remove "inline" function definition (Vladmir)
    - remove extrenous braces in branches (Vladmir)
    - change inline function names (Pedro)
    - Run tdc tests (Victor)
    v2->v3 changes:
    - dont break else/if (Simon)
    
    Fixes: 1da177e4c3f4 ("Linux-2.6.12-rc2")
    Reported-by: syzbot+a3618a167af2021433cd@syzkaller.appspotmail.com
    Closes: https://lore.kernel.org/netdev/20230816225759.g25x76kmgzya2gei@skbuf/T/
    Tested-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Tested-by: Victor Nogueira <victor@mojatatu.com>
    Reviewed-by: Pedro Tammela <pctammela@mojatatu.com>
    Reviewed-by: Victor Nogueira <victor@mojatatu.com>
    Signed-off-by: Jamal Hadi Salim <jhs@mojatatu.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 0717a95ba5ca3f6ec1e7e211659ee734eb257591
Author: Sasha Neftin <sasha.neftin@intel.com>
Date:   Mon Aug 21 10:17:21 2023 -0700

    igc: Fix the typo in the PTM Control macro
    
    [ Upstream commit de43975721b97283d5f17eea4228faddf08f2681 ]
    
    The IGC_PTM_CTRL_SHRT_CYC defines the time between two consecutive PTM
    requests. The bit resolution of this field is six bits. That bit five was
    missing in the mask. This patch comes to correct the typo in the
    IGC_PTM_CTRL_SHRT_CYC macro.
    
    Fixes: a90ec8483732 ("igc: Add support for PTP getcrosststamp()")
    Signed-off-by: Sasha Neftin <sasha.neftin@intel.com>
    Tested-by: Naama Meir <naamax.meir@linux.intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Kalesh AP <kalesh-anakkur.purayil@broadcom.com>
    Link: https://lore.kernel.org/r/20230821171721.2203572-1-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 8fe9d54f7ad486012fe31ba5ac9ecc39365fd0bc
Author: Alessio Igor Bogani <alessio.bogani@elettra.eu>
Date:   Mon Aug 21 10:19:27 2023 -0700

    igb: Avoid starting unnecessary workqueues
    
    [ Upstream commit b888c510f7b3d64ca75fc0f43b4a4bd1a611312f ]
    
    If ptp_clock_register() fails or CONFIG_PTP isn't enabled, avoid starting
    PTP related workqueues.
    
    In this way we can fix this:
     BUG: unable to handle page fault for address: ffffc9000440b6f8
     #PF: supervisor read access in kernel mode
     #PF: error_code(0x0000) - not-present page
     PGD 100000067 P4D 100000067 PUD 1001e0067 PMD 107dc5067 PTE 0
     Oops: 0000 [#1] PREEMPT SMP
     [...]
     Workqueue: events igb_ptp_overflow_check
     RIP: 0010:igb_rd32+0x1f/0x60
     [...]
     Call Trace:
      igb_ptp_read_82580+0x20/0x50
      timecounter_read+0x15/0x60
      igb_ptp_overflow_check+0x1a/0x50
      process_one_work+0x1cb/0x3c0
      worker_thread+0x53/0x3f0
      ? rescuer_thread+0x370/0x370
      kthread+0x142/0x160
      ? kthread_associate_blkcg+0xc0/0xc0
      ret_from_fork+0x1f/0x30
    
    Fixes: 1f6e8178d685 ("igb: Prevent dropped Tx timestamps via work items and interrupts.")
    Fixes: d339b1331616 ("igb: add PTP Hardware Clock code")
    Signed-off-by: Alessio Igor Bogani <alessio.bogani@elettra.eu>
    Tested-by: Arpana Arland <arpanax.arland@intel.com> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/20230821171927.2203644-1-anthony.l.nguyen@intel.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit ecebc084136273fcbed7238d6e361df37a01ef6e
Author: Oliver Hartkopp <socketcan@hartkopp.net>
Date:   Mon Aug 21 16:45:46 2023 +0200

    can: isotp: fix support for transmission of SF without flow control
    
    [ Upstream commit 0bfe71159230bab79ee230225ae12ffecbb69f3e ]
    
    The original implementation had a very simple handling for single frame
    transmissions as it just sent the single frame without a timeout handling.
    
    With the new echo frame handling the echo frame was also introduced for
    single frames but the former exception ('simple without timers') has been
    maintained by accident. This leads to a 1 second timeout when closing the
    socket and to an -ECOMM error when CAN_ISOTP_WAIT_TX_DONE is selected.
    
    As the echo handling is always active (also for single frames) remove the
    wrong extra condition for single frames.
    
    Fixes: 9f39d36530e5 ("can: isotp: add support for transmission without flow control")
    Signed-off-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Link: https://lore.kernel.org/r/20230821144547.6658-2-socketcan@hartkopp.net
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 65009906dff2be724d872a4f472fb203cbe8fc8c
Author: Daniel Golle <daniel@makrotopia.org>
Date:   Mon Aug 21 17:12:44 2023 +0100

    net: ethernet: mtk_eth_soc: fix NULL pointer on hw reset
    
    [ Upstream commit 604204fcb321abe81238551936ecda5269e81076 ]
    
    When a hardware reset is triggered on devices not initializing WED the
    calls to mtk_wed_fe_reset and mtk_wed_fe_reset_complete dereference a
    pointer on uninitialized stack memory.
    Break out of both functions in case a hw_list entry is 0.
    
    Fixes: 08a764a7c51b ("net: ethernet: mtk_wed: add reset/reset_complete callbacks")
    Signed-off-by: Daniel Golle <daniel@makrotopia.org>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Acked-by: Lorenzo Bianconi <lorenzo@kernel.org>
    Link: https://lore.kernel.org/r/5465c1609b464cc7407ae1530c40821dcdf9d3e6.1692634266.git.daniel@makrotopia.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit d56f8304bcc476fe1524ac7e9a74f7b30bf10ab5
Author: Kees Cook <keescook@chromium.org>
Date:   Fri Aug 18 10:54:21 2023 -0700

    tg3: Use slab_build_skb() when needed
    
    [ Upstream commit 99b415fe8986803ba0eaf6b8897b16edc8fe7ec2 ]
    
    The tg3 driver will use kmalloc() under some conditions. Check the
    frag_size and use slab_build_skb() when frag_size is 0. Silences
    the warning introduced by commit ce098da1497c ("skbuff: Introduce
    slab_build_skb()"):
    
            Use slab_build_skb() instead
            ...
            tg3_poll_work+0x638/0xf90 [tg3]
    
    Fixes: ce098da1497c ("skbuff: Introduce slab_build_skb()")
    Reported-by: Fiona Ebner <f.ebner@proxmox.com>
    Closes: https://lore.kernel.org/all/1bd4cb9c-4eb8-3bdb-3e05-8689817242d1@proxmox.com
    Cc: Siva Reddy Kallam <siva.kallam@broadcom.com>
    Cc: Prashant Sreedharan <prashant@broadcom.com>
    Cc: Michael Chan <mchan@broadcom.com>
    Cc: Bagas Sanjaya <bagasdotme@gmail.com>
    Signed-off-by: Kees Cook <keescook@chromium.org>
    Reviewed-by: Pavan Chebbi <pavan.chebbi@broadcom.com>
    Link: https://lore.kernel.org/r/20230818175417.never.273-kees@kernel.org
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit be7d58c9a203d1aa68451ff624e639e7a3807bcb
Author: Hangbin Liu <liuhangbin@gmail.com>
Date:   Thu Aug 17 16:24:59 2023 +0800

    selftests: bonding: do not set port down before adding to bond
    
    [ Upstream commit be809424659c2844a2d7ab653aacca4898538023 ]
    
    Before adding a port to bond, it need to be set down first. In the
    lacpdu test the author set the port down specifically. But commit
    a4abfa627c38 ("net: rtnetlink: Enslave device before bringing it up")
    changed the operation order, the kernel will set the port down _after_
    adding to bond. So all the ports will be down at last and the test failed.
    
    In fact, the veth interfaces are already inactive when added. This
    means there's no need to set them down again before adding to the bond.
    Let's just remove the link down operation.
    
    Fixes: a4abfa627c38 ("net: rtnetlink: Enslave device before bringing it up")
    Reported-by: Zhengchao Shao <shaozhengchao@huawei.com>
    Closes: https://lore.kernel.org/netdev/a0ef07c7-91b0-94bd-240d-944a330fcabd@huawei.com/
    Signed-off-by: Hangbin Liu <liuhangbin@gmail.com>
    Link: https://lore.kernel.org/r/20230817082459.1685972-1-liuhangbin@gmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit b995365bbdd8587dc67954e19c65ef8d82f15b69
Author: Petr Oros <poros@redhat.com>
Date:   Fri Aug 11 10:07:02 2023 +0200

    ice: Fix NULL pointer deref during VF reset
    
    [ Upstream commit 67f6317dfa609846a227a706532439a22828c24b ]
    
    During stress test with attaching and detaching VF from KVM and
    simultaneously changing VFs spoofcheck and trust there was a
    NULL pointer dereference in ice_reset_vf that VF's VSI is null.
    
    More than one instance of ice_reset_vf() can be running at a given
    time. When we rebuild the VSI in ice_reset_vf, another reset can be
    triaged from ice_service_task. In this case we can access the currently
    uninitialized VSI and cause panic. The window for this racing condition
    has been around for a long time but it's much worse after commit
    227bf4500aaa ("ice: move VSI delete outside deconfig") because
    the reset runs faster. ice_reset_vf() using vf->cfg_lock and when
    we move this lock before accessing to the VF VSI, we can fix
    BUG for all cases.
    
    Panic occurs sometimes in ice_vsi_is_rx_queue_active() and sometimes
    in ice_vsi_stop_all_rx_rings()
    
    With our reproducer, we can hit BUG:
    ~8h before commit 227bf4500aaa ("ice: move VSI delete outside deconfig").
    ~20m after commit 227bf4500aaa ("ice: move VSI delete outside deconfig").
    After this fix we are not able to reproduce it after ~48h
    
    There was commit cf90b74341ee ("ice: Fix call trace with null VSI during
    VF reset") which also tried to fix this issue, but it was only
    partially resolved and the bug still exists.
    
    [ 6420.658415] BUG: kernel NULL pointer dereference, address: 0000000000000000
    [ 6420.665382] #PF: supervisor read access in kernel mode
    [ 6420.670521] #PF: error_code(0x0000) - not-present page
    [ 6420.675659] PGD 0
    [ 6420.677679] Oops: 0000 [#1] PREEMPT SMP NOPTI
    [ 6420.682038] CPU: 53 PID: 326472 Comm: kworker/53:0 Kdump: loaded Not tainted 5.14.0-317.el9.x86_64 #1
    [ 6420.691250] Hardware name: Dell Inc. PowerEdge R750/04V528, BIOS 1.6.5 04/15/2022
    [ 6420.698729] Workqueue: ice ice_service_task [ice]
    [ 6420.703462] RIP: 0010:ice_vsi_is_rx_queue_active+0x2d/0x60 [ice]
    [ 6420.705860] ice 0000:ca:00.0: VF 0 is now untrusted
    [ 6420.709494] Code: 00 00 66 83 bf 76 04 00 00 00 48 8b 77 10 74 3e 31 c0 eb 0f 0f b7 97 76 04 00 00 48 83 c0 01 39 c2 7e 2b 48 8b 97 68 04 00 00 <0f> b7 0c 42 48 8b 96 20 13 00 00 48 8d 94 8a 00 00 12 00 8b 12 83
    [ 6420.714426] ice 0000:ca:00.0 ens7f0: Setting MAC 22:22:22:22:22:00 on VF 0. VF driver will be reinitialized
    [ 6420.733120] RSP: 0018:ff778d2ff383fdd8 EFLAGS: 00010246
    [ 6420.733123] RAX: 0000000000000000 RBX: ff2acf1916294000 RCX: 0000000000000000
    [ 6420.733125] RDX: 0000000000000000 RSI: ff2acf1f2c6401a0 RDI: ff2acf1a27301828
    [ 6420.762346] RBP: ff2acf1a27301828 R08: 0000000000000010 R09: 0000000000001000
    [ 6420.769476] R10: ff2acf1916286000 R11: 00000000019eba3f R12: ff2acf19066460d0
    [ 6420.776611] R13: ff2acf1f2c6401a0 R14: ff2acf1f2c6401a0 R15: 00000000ffffffff
    [ 6420.783742] FS:  0000000000000000(0000) GS:ff2acf28ffa80000(0000) knlGS:0000000000000000
    [ 6420.791829] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [ 6420.797575] CR2: 0000000000000000 CR3: 00000016ad410003 CR4: 0000000000773ee0
    [ 6420.804708] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    [ 6420.811034] vfio-pci 0000:ca:01.0: enabling device (0000 -> 0002)
    [ 6420.811840] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    [ 6420.811841] PKRU: 55555554
    [ 6420.811842] Call Trace:
    [ 6420.811843]  <TASK>
    [ 6420.811844]  ice_reset_vf+0x9a/0x450 [ice]
    [ 6420.811876]  ice_process_vflr_event+0x8f/0xc0 [ice]
    [ 6420.841343]  ice_service_task+0x23b/0x600 [ice]
    [ 6420.845884]  ? __schedule+0x212/0x550
    [ 6420.849550]  process_one_work+0x1e2/0x3b0
    [ 6420.853563]  ? rescuer_thread+0x390/0x390
    [ 6420.857577]  worker_thread+0x50/0x3a0
    [ 6420.861242]  ? rescuer_thread+0x390/0x390
    [ 6420.865253]  kthread+0xdd/0x100
    [ 6420.868400]  ? kthread_complete_and_exit+0x20/0x20
    [ 6420.873194]  ret_from_fork+0x1f/0x30
    [ 6420.876774]  </TASK>
    [ 6420.878967] Modules linked in: vfio_pci vfio_pci_core vfio_iommu_type1 vfio iavf vhost_net vhost vhost_iotlb tap tun xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_counter nf_tables bridge stp llc sctp ip6_udp_tunnel udp_tunnel nfp tls nfnetlink bluetooth mlx4_en mlx4_core rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache netfs rfkill sunrpc intel_rapl_msr intel_rapl_common i10nm_edac nfit libnvdimm ipmi_ssif x86_pkg_temp_thermal intel_powerclamp coretemp irdma kvm_intel i40e kvm iTCO_wdt dcdbas ib_uverbs irqbypass iTCO_vendor_support mgag200 mei_me ib_core dell_smbios isst_if_mmio isst_if_mbox_pci rapl i2c_algo_bit drm_shmem_helper intel_cstate drm_kms_helper syscopyarea sysfillrect isst_if_common sysimgblt intel_uncore fb_sys_fops dell_wmi_descriptor wmi_bmof intel_vsec mei i2c_i801 acpi_ipmi ipmi_si i2c_smbus ipmi_devintf intel_pch_thermal acpi_power_meter pcspk
     r
    
    Fixes: efe41860008e ("ice: Fix memory corruption in VF driver")
    Fixes: f23df5220d2b ("ice: Fix spurious interrupt during removal of trusted VF")
    Signed-off-by: Petr Oros <poros@redhat.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com>
    Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
    Tested-by: Rafal Romanowski <rafal.romanowski@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 9298928776203c2c47a1eba343fed15c00ab8eea
Author: Petr Oros <poros@redhat.com>
Date:   Fri Aug 11 10:07:01 2023 +0200

    Revert "ice: Fix ice VF reset during iavf initialization"
    
    [ Upstream commit 0ecff05e6c59dd82dbcb9706db911f7fd9f40fb8 ]
    
    This reverts commit 7255355a0636b4eff08d5e8139c77d98f151c4fc.
    
    After this commit we are not able to attach VF to VM:
    virsh attach-interface v0 hostdev --managed 0000:41:01.0 --mac 52:52:52:52:52:52
    error: Failed to attach interface
    error: Cannot set interface MAC to 52:52:52:52:52:52 for ifname enp65s0f0np0 vf 0: Resource temporarily unavailable
    
    ice_check_vf_ready_for_cfg() already contain waiting for reset.
    New condition in ice_check_vf_ready_for_reset() causing only problems.
    
    Fixes: 7255355a0636 ("ice: Fix ice VF reset during iavf initialization")
    Signed-off-by: Petr Oros <poros@redhat.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com>
    Reviewed-by: Jacob Keller <jacob.e.keller@intel.com>
    Tested-by: Rafal Romanowski <rafal.romanowski@intel.com>
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 8aa038c250420f433d375f815c1d54a6afbcc988
Author: Jesse Brandeburg <jesse.brandeburg@intel.com>
Date:   Thu Aug 10 16:51:10 2023 -0700

    ice: fix receive buffer size miscalculation
    
    [ Upstream commit 10083aef784031fa9f06c19a1b182e6fad5338d9 ]
    
    The driver is misconfiguring the hardware for some values of MTU such that
    it could use multiple descriptors to receive a packet when it could have
    simply used one.
    
    Change the driver to use a round-up instead of the result of a shift, as
    the shift can truncate the lower bits of the size, and result in the
    problem noted above. It also aligns this driver with similar code in i40e.
    
    The insidiousness of this problem is that everything works with the wrong
    size, it's just not working as well as it could, as some MTU sizes end up
    using two or more descriptors, and there is no way to tell that is
    happening without looking at ice_trace or a bus analyzer.
    
    Fixes: efc2214b6047 ("ice: Add support for XDP")
    Reviewed-by: Przemek Kitszel <przemyslaw.kitszel@intel.com>
    Signed-off-by: Jesse Brandeburg <jesse.brandeburg@intel.com>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Tested-by: Pucha Himasekhar Reddy <himasekharx.reddy.pucha@intel.com> (A Contingent worker at Intel)
    Signed-off-by: Tony Nguyen <anthony.l.nguyen@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit abee4c8eb7785e9ccfd1c86f490655d76f209622
Author: Eric Dumazet <edumazet@google.com>
Date:   Sat Aug 19 03:17:07 2023 +0000

    ipv4: fix data-races around inet->inet_id
    
    [ Upstream commit f866fbc842de5976e41ba874b76ce31710b634b5 ]
    
    UDP sendmsg() is lockless, so ip_select_ident_segs()
    can very well be run from multiple cpus [1]
    
    Convert inet->inet_id to an atomic_t, but implement
    a dedicated path for TCP, avoiding cost of a locked
    instruction (atomic_add_return())
    
    Note that this patch will cause a trivial merge conflict
    because we added inet->flags in net-next tree.
    
    v2: added missing change in
    drivers/net/ethernet/chelsio/inline_crypto/chtls/chtls_cm.c
    (David Ahern)
    
    [1]
    
    BUG: KCSAN: data-race in __ip_make_skb / __ip_make_skb
    
    read-write to 0xffff888145af952a of 2 bytes by task 7803 on cpu 1:
    ip_select_ident_segs include/net/ip.h:542 [inline]
    ip_select_ident include/net/ip.h:556 [inline]
    __ip_make_skb+0x844/0xc70 net/ipv4/ip_output.c:1446
    ip_make_skb+0x233/0x2c0 net/ipv4/ip_output.c:1560
    udp_sendmsg+0x1199/0x1250 net/ipv4/udp.c:1260
    inet_sendmsg+0x63/0x80 net/ipv4/af_inet.c:830
    sock_sendmsg_nosec net/socket.c:725 [inline]
    sock_sendmsg net/socket.c:748 [inline]
    ____sys_sendmsg+0x37c/0x4d0 net/socket.c:2494
    ___sys_sendmsg net/socket.c:2548 [inline]
    __sys_sendmmsg+0x269/0x500 net/socket.c:2634
    __do_sys_sendmmsg net/socket.c:2663 [inline]
    __se_sys_sendmmsg net/socket.c:2660 [inline]
    __x64_sys_sendmmsg+0x57/0x60 net/socket.c:2660
    do_syscall_x64 arch/x86/entry/common.c:50 [inline]
    do_syscall_64+0x41/0xc0 arch/x86/entry/common.c:80
    entry_SYSCALL_64_after_hwframe+0x63/0xcd
    
    read to 0xffff888145af952a of 2 bytes by task 7804 on cpu 0:
    ip_select_ident_segs include/net/ip.h:541 [inline]
    ip_select_ident include/net/ip.h:556 [inline]
    __ip_make_skb+0x817/0xc70 net/ipv4/ip_output.c:1446
    ip_make_skb+0x233/0x2c0 net/ipv4/ip_output.c:1560
    udp_sendmsg+0x1199/0x1250 net/ipv4/udp.c:1260
    inet_sendmsg+0x63/0x80 net/ipv4/af_inet.c:830
    sock_sendmsg_nosec net/socket.c:725 [inline]
    sock_sendmsg net/socket.c:748 [inline]
    ____sys_sendmsg+0x37c/0x4d0 net/socket.c:2494
    ___sys_sendmsg net/socket.c:2548 [inline]
    __sys_sendmmsg+0x269/0x500 net/socket.c:2634
    __do_sys_sendmmsg net/socket.c:2663 [inline]
    __se_sys_sendmmsg net/socket.c:2660 [inline]
    __x64_sys_sendmmsg+0x57/0x60 net/socket.c:2660
    do_syscall_x64 arch/x86/entry/common.c:50 [inline]
    do_syscall_64+0x41/0xc0 arch/x86/entry/common.c:80
    entry_SYSCALL_64_after_hwframe+0x63/0xcd
    
    value changed: 0x184d -> 0x184e
    
    Reported by Kernel Concurrency Sanitizer on:
    CPU: 0 PID: 7804 Comm: syz-executor.1 Not tainted 6.5.0-rc6-syzkaller #0
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 07/26/2023
    ==================================================================
    
    Fixes: 23f57406b82d ("ipv4: avoid using shared IP generator for connected sockets")
    Reported-by: syzbot <syzkaller@googlegroups.com>
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Reviewed-by: David Ahern <dsahern@kernel.org>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 3844e0c559772857d9fa17e1e2ba9d0793f721f0
Author: Jakub Kicinski <kuba@kernel.org>
Date:   Fri Aug 18 18:26:02 2023 -0700

    net: validate veth and vxcan peer ifindexes
    
    [ Upstream commit f534f6581ec084fe94d6759f7672bd009794b07e ]
    
    veth and vxcan need to make sure the ifindexes of the peer
    are not negative, core does not validate this.
    
    Using iproute2 with user-space-level checking removed:
    
    Before:
    
      # ./ip link add index 10 type veth peer index -1
      # ip link show
      1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN mode DEFAULT group default qlen 1000
        link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
      2: enp1s0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UP mode DEFAULT group default qlen 1000
        link/ether 52:54:00:74:b2:03 brd ff:ff:ff:ff:ff:ff
      10: veth1@veth0: <BROADCAST,MULTICAST,M-DOWN> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000
        link/ether 8a:90:ff:57:6d:5d brd ff:ff:ff:ff:ff:ff
      -1: veth0@veth1: <BROADCAST,MULTICAST,M-DOWN> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000
        link/ether ae:ed:18:e6:fa:7f brd ff:ff:ff:ff:ff:ff
    
    Now:
    
      $ ./ip link add index 10 type veth peer index -1
      Error: ifindex can't be negative.
    
    This problem surfaced in net-next because an explicit WARN()
    was added, the root cause is older.
    
    Fixes: e6f8f1a739b6 ("veth: Allow to create peer link with given ifindex")
    Fixes: a8f820a380a2 ("can: add Virtual CAN Tunnel driver (vxcan)")
    Reported-by: syzbot+5ba06978f34abb058571@syzkaller.appspotmail.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Reviewed-by: Eric Dumazet <edumazet@google.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 691799211bf115e946ada9f2f38ab8939c72996a
Author: Ruan Jinjie <ruanjinjie@huawei.com>
Date:   Fri Aug 18 13:12:21 2023 +0800

    net: bcmgenet: Fix return value check for fixed_phy_register()
    
    [ Upstream commit 32bbe64a1386065ab2aef8ce8cae7c689d0add6e ]
    
    The fixed_phy_register() function returns error pointers and never
    returns NULL. Update the checks accordingly.
    
    Fixes: b0ba512e25d7 ("net: bcmgenet: enable driver to work without a device tree")
    Signed-off-by: Ruan Jinjie <ruanjinjie@huawei.com>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Acked-by: Doug Berger <opendmb@gmail.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit d3a74a85fbb42c774ae75390242a10c34dcee01c
Author: Ruan Jinjie <ruanjinjie@huawei.com>
Date:   Fri Aug 18 13:12:20 2023 +0800

    net: bgmac: Fix return value check for fixed_phy_register()
    
    [ Upstream commit 23a14488ea5882dea5851b65c9fce2127ee8fcad ]
    
    The fixed_phy_register() function returns error pointers and never
    returns NULL. Update the checks accordingly.
    
    Fixes: c25b23b8a387 ("bgmac: register fixed PHY for ARM BCM470X / BCM5301X chipsets")
    Signed-off-by: Ruan Jinjie <ruanjinjie@huawei.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit a7cecd332c9ed376854c9fadf549619d32f21664
Author: Serge Semin <fancer.lancer@gmail.com>
Date:   Wed Aug 16 21:06:52 2023 +0300

    net: mdio: mdio-bitbang: Fix C45 read/write protocol
    
    [ Upstream commit 2572ce62415cf3b632391091447252e2661ed520 ]
    
    Based on the original code semantic in case of Clause 45 MDIO, the address
    command is supposed to be followed by the command sending the MMD address,
    not the CSR address. The commit 002dd3de097c ("net: mdio: mdio-bitbang:
    Separate C22 and C45 transactions") has erroneously broken that. So most
    likely due to an unfortunate variable name it switched the code to sending
    the CSR address. In our case it caused the protocol malfunction so the
    read operation always failed with the turnaround bit always been driven to
    one by PHY instead of zero. Fix that by getting back the correct
    behaviour: sending MMD address command right after the regular address
    command.
    
    Fixes: 002dd3de097c ("net: mdio: mdio-bitbang: Separate C22 and C45 transactions")
    Signed-off-by: Serge Semin <fancer.lancer@gmail.com>
    Reviewed-by: Andrew Lunn <andrew@lunn.ch>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 7e7b2b50dcd969db8dccd2a653b4322dc8cd6e4e
Author: Arınç ÜNAL <arinc.unal@arinc9.com>
Date:   Sun Aug 13 13:59:17 2023 +0300

    net: dsa: mt7530: fix handling of 802.1X PAE frames
    
    [ Upstream commit e94b590abfff2cdbf0bdaa7d9904364c8d480af5 ]
    
    802.1X PAE frames are link-local frames, therefore they must be trapped to
    the CPU port. Currently, the MT753X switches treat 802.1X PAE frames as
    regular multicast frames, therefore flooding them to user ports. To fix
    this, set 802.1X PAE frames to be trapped to the CPU port(s).
    
    Fixes: b8f126a8d543 ("net-next: dsa: add dsa support for Mediatek MT7530 switch")
    Signed-off-by: Arınç ÜNAL <arinc.unal@arinc9.com>
    Reviewed-by: Vladimir Oltean <olteanv@gmail.com>
    Signed-off-by: David S. Miller <davem@davemloft.net>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit b457f312e78e9199a1be5363b76e792181c09f91
Author: Ido Schimmel <idosch@nvidia.com>
Date:   Thu Aug 17 15:58:25 2023 +0200

    selftests: mlxsw: Fix test failure on Spectrum-4
    
    [ Upstream commit f520489e99a35b0a5257667274fbe9afd2d8c50b ]
    
    Remove assumptions about shared buffer cell size and instead query the
    cell size from devlink. Adjust the test to send small packets that fit
    inside a single cell.
    
    Tested on Spectrum-{1,2,3,4}.
    
    Fixes: 4735402173e6 ("mlxsw: spectrum: Extend to support Spectrum-4 ASIC")
    Signed-off-by: Ido Schimmel <idosch@nvidia.com>
    Reviewed-by: Petr Machata <petrm@nvidia.com>
    Signed-off-by: Petr Machata <petrm@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/f7dfbf3c4d1cb23838d9eb99bab09afaa320c4ca.1692268427.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 747e71ff06bfd9cdf13ff990773481ab5e461ef3
Author: Amit Cohen <amcohen@nvidia.com>
Date:   Thu Aug 17 15:58:24 2023 +0200

    mlxsw: Fix the size of 'VIRT_ROUTER_MSB'
    
    [ Upstream commit 348c976be0a599918b88729def198a843701c9fe ]
    
    The field 'virtual router' was extended to 12 bits in Spectrum-4.
    Therefore, the element 'MLXSW_AFK_ELEMENT_VIRT_ROUTER_MSB' needs 3 bits for
    Spectrum < 4 and 4 bits for Spectrum >= 4.
    
    The elements are stored in an internal storage scratchpad. Currently, the
    MSB is defined there as 3 bits. It means that for Spectrum-4, only 2K VRFs
    can be used for multicast routing, as the highest bit is not really used by
    the driver. Fix the definition of 'VIRT_ROUTER_MSB' to use 4 bits. Adjust
    the definitions of 'virtual router' field in the blocks accordingly - use
    '_avoid_size_check' for Spectrum-2 instead of for Spectrum-4. Fix the mask
    in parse function to use 4 bits.
    
    Fixes: 6d5d8ebb881c ("mlxsw: Rename virtual router flex key element")
    Signed-off-by: Amit Cohen <amcohen@nvidia.com>
    Reviewed-by: Ido Schimmel <idosch@nvidia.com>
    Signed-off-by: Petr Machata <petrm@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/79bed2b70f6b9ed58d4df02e9798a23da648015b.1692268427.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 5a76c5256501a99d4dc15fa7c69fe2cef852765c
Author: Ido Schimmel <idosch@nvidia.com>
Date:   Thu Aug 17 15:58:23 2023 +0200

    mlxsw: reg: Fix SSPR register layout
    
    [ Upstream commit 0dc63b9cfd4c5666ced52c829fdd65dcaeb9f0f1 ]
    
    The two most significant bits of the "local_port" field in the SSPR
    register are always cleared since they are overwritten by the deprecated
    and overlapping "sub_port" field.
    
    On systems with more than 255 local ports (e.g., Spectrum-4), this
    results in the firmware maintaining invalid mappings between system port
    and local port. Specifically, two different systems ports (0x1 and
    0x101) point to the same local port (0x1), which eventually leads to
    firmware errors.
    
    Fix by removing the deprecated "sub_port" field.
    
    Fixes: fd24b29a1b74 ("mlxsw: reg: Align existing registers to use extended local_port field")
    Signed-off-by: Ido Schimmel <idosch@nvidia.com>
    Signed-off-by: Petr Machata <petrm@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/9b909a3033c8d3d6f67f237306bef4411c5e6ae4.1692268427.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 40ffbae5312a0c5aabaf380cd28ebfe5d452eb86
Author: Danielle Ratson <danieller@nvidia.com>
Date:   Thu Aug 17 15:58:22 2023 +0200

    mlxsw: pci: Set time stamp fields also when its type is MIRROR_UTC
    
    [ Upstream commit bc2de151ab6ad0762a04563527ec42e54dde572a ]
    
    Currently, in Spectrum-2 and above, time stamps are extracted from the CQE
    into the time stamp fields in 'struct mlxsw_skb_cb', only when the CQE
    time stamp type is UTC. The time stamps are read directly from the CQE and
    software can get the time stamp in UTC format using CQEv2.
    
    From Spectrum-4, the time stamps that are read from the CQE are allowed
    to be also from MIRROR_UTC type.
    
    Therefore, we get a warning [1] from the driver that the time stamp fields
    were not set, when LLDP control packet is sent.
    
    Allow the time stamp type to be MIRROR_UTC and set the time stamp in this
    case as well.
    
    [1]
     WARNING: CPU: 11 PID: 0 at drivers/net/ethernet/mellanox/mlxsw/spectrum_ptp.c:1409 mlxsw_sp2_ptp_hwtstamp_fill+0x1f/0x70 [mlxsw_spectrum]
    [...]
     Call Trace:
      <IRQ>
      mlxsw_sp2_ptp_receive+0x3c/0x80 [mlxsw_spectrum]
      mlxsw_core_skb_receive+0x119/0x190 [mlxsw_core]
      mlxsw_pci_cq_tasklet+0x3c9/0x780 [mlxsw_pci]
      tasklet_action_common.constprop.0+0x9f/0x110
      __do_softirq+0xbb/0x296
      irq_exit_rcu+0x79/0xa0
      common_interrupt+0x86/0xa0
      </IRQ>
      <TASK>
    
    Fixes: 4735402173e6 ("mlxsw: spectrum: Extend to support Spectrum-4 ASIC")
    Signed-off-by: Danielle Ratson <danieller@nvidia.com>
    Reviewed-by: Ido Schimmel <idosch@nvidia.com>
    Signed-off-by: Petr Machata <petrm@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/bcef4d044ef608a4e258d33a7ec0ecd91f480db5.1692268427.git.petrm@nvidia.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 3f5a3e0274107056a4891428f80b2c99c98f5695
Author: Lu Wei <luwei32@huawei.com>
Date:   Thu Aug 17 22:54:49 2023 +0800

    ipvlan: Fix a reference count leak warning in ipvlan_ns_exit()
    
    [ Upstream commit 043d5f68d0ccdda91029b4b6dce7eeffdcfad281 ]
    
    There are two network devices(veth1 and veth3) in ns1, and ipvlan1 with
    L3S mode and ipvlan2 with L2 mode are created based on them as
    figure (1). In this case, ipvlan_register_nf_hook() will be called to
    register nf hook which is needed by ipvlans in L3S mode in ns1 and value
    of ipvl_nf_hook_refcnt is set to 1.
    
    (1)
               ns1                           ns2
          ------------                  ------------
    
       veth1--ipvlan1 (L3S)
    
       veth3--ipvlan2 (L2)
    
    (2)
               ns1                           ns2
          ------------                  ------------
    
       veth1--ipvlan1 (L3S)
    
             ipvlan2 (L2)                  veth3
         |                                  |
         |------->-------->--------->--------
                        migrate
    
    When veth3 migrates from ns1 to ns2 as figure (2), veth3 will register in
    ns2 and calls call_netdevice_notifiers with NETDEV_REGISTER event:
    
    dev_change_net_namespace
        call_netdevice_notifiers
            ipvlan_device_event
                ipvlan_migrate_l3s_hook
                    ipvlan_register_nf_hook(newnet)      (I)
                    ipvlan_unregister_nf_hook(oldnet)    (II)
    
    In function ipvlan_migrate_l3s_hook(), ipvl_nf_hook_refcnt in ns1 is not 0
    since veth1 with ipvlan1 still in ns1, (I) and (II) will be called to
    register nf_hook in ns2 and unregister nf_hook in ns1. As a result,
    ipvl_nf_hook_refcnt in ns1 is decreased incorrectly and this in ns2
    is increased incorrectly. When the second net namespace is removed, a
    reference count leak warning in ipvlan_ns_exit() will be triggered.
    
    This patch add a check before ipvlan_migrate_l3s_hook() is called. The
    warning can be triggered as follows:
    
    $ ip netns add ns1
    $ ip netns add ns2
    $ ip netns exec ns1 ip link add veth1 type veth peer name veth2
    $ ip netns exec ns1 ip link add veth3 type veth peer name veth4
    $ ip netns exec ns1 ip link add ipv1 link veth1 type ipvlan mode l3s
    $ ip netns exec ns1 ip link add ipv2 link veth3 type ipvlan mode l2
    $ ip netns exec ns1 ip link set veth3 netns ns2
    $ ip net del ns2
    
    Fixes: 3133822f5ac1 ("ipvlan: use pernet operations and restrict l3s hooks to master netns")
    Signed-off-by: Lu Wei <luwei32@huawei.com>
    Reviewed-by: Florian Westphal <fw@strlen.de>
    Link: https://lore.kernel.org/r/20230817145449.141827-1-luwei32@huawei.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 056e0ce1f1c0b8da388e5f7de16afab56c4ff95f
Author: Eric Dumazet <edumazet@google.com>
Date:   Fri Aug 18 01:58:20 2023 +0000

    dccp: annotate data-races in dccp_poll()
    
    [ Upstream commit cba3f1786916063261e3e5ccbb803abc325b24ef ]
    
    We changed tcp_poll() over time, bug never updated dccp.
    
    Note that we also could remove dccp instead of maintaining it.
    
    Fixes: 7c657876b63c ("[DCCP]: Initial implementation")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Link: https://lore.kernel.org/r/20230818015820.2701595-1-edumazet@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 2a7d2f2b8c2caaa326b6551ab287fba32319ab55
Author: Eric Dumazet <edumazet@google.com>
Date:   Fri Aug 18 01:51:32 2023 +0000

    sock: annotate data-races around prot->memory_pressure
    
    [ Upstream commit 76f33296d2e09f63118db78125c95ef56df438e9 ]
    
    *prot->memory_pressure is read/writen locklessly, we need
    to add proper annotations.
    
    A recent commit added a new race, it is time to audit all accesses.
    
    Fixes: 2d0c88e84e48 ("sock: Fix misuse of sk_under_memory_pressure()")
    Fixes: 4d93df0abd50 ("[SCTP]: Rewrite of sctp buffer management code")
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Abel Wu <wuyun.abel@bytedance.com>
    Reviewed-by: Shakeel Butt <shakeelb@google.com>
    Link: https://lore.kernel.org/r/20230818015132.2699348-1-edumazet@google.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit b8bcc45afcd31f4bf3ab1d5a95d6256be9a686e8
Author: Vladimir Oltean <vladimir.oltean@nxp.com>
Date:   Thu Aug 17 15:01:11 2023 +0300

    net: dsa: felix: fix oversize frame dropping for always closed tc-taprio gates
    
    [ Upstream commit d44036cad31170da0cb9c728e80743f84267da6e ]
    
    The blamed commit resolved a bug where frames would still get stuck at
    egress, even though they're smaller than the maxSDU[tc], because the
    driver did not take into account the extra 33 ns that the queue system
    needs for scheduling the frame.
    
    It now takes that into account, but the arithmetic that we perform in
    vsc9959_tas_remaining_gate_len_ps() is buggy, because we operate on
    64-bit unsigned integers, so gate_len_ns - VSC9959_TAS_MIN_GATE_LEN_NS
    may become a very large integer if gate_len_ns < 33 ns.
    
    In practice, this means that we've introduced a regression where all
    traffic class gates which are permanently closed will not get detected
    by the driver, and we won't enable oversize frame dropping for them.
    
    Before:
    mscc_felix 0000:00:00.5: port 0: max frame size 1526 needs 12400000 ps, 1152000 ps for mPackets at speed 1000
    mscc_felix 0000:00:00.5: port 0 tc 0 min gate len 1000000, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 1 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 2 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 3 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 4 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 5 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 6 min gate len 0, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 7 min gate length 5120 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 615 octets including FCS
    
    After:
    mscc_felix 0000:00:00.5: port 0: max frame size 1526 needs 12400000 ps, 1152000 ps for mPackets at speed 1000
    mscc_felix 0000:00:00.5: port 0 tc 0 min gate len 1000000, sending all frames
    mscc_felix 0000:00:00.5: port 0 tc 1 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 2 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 3 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 4 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 5 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 6 min gate length 0 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 1 octets including FCS
    mscc_felix 0000:00:00.5: port 0 tc 7 min gate length 5120 ns not enough for max frame size 1526 at 1000 Mbps, dropping frames over 615 octets including FCS
    
    Fixes: 11afdc6526de ("net: dsa: felix: tc-taprio intervals smaller than MTU should send at least one packet")
    Signed-off-by: Vladimir Oltean <vladimir.oltean@nxp.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/20230817120111.3522827-1-vladimir.oltean@nxp.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit e3b4e5276ccd9b6c586d559f2c7a7f1a850cd3de
Author: Jiri Pirko <jiri@resnulli.us>
Date:   Thu Aug 17 14:52:40 2023 +0200

    devlink: add missing unregister linecard notification
    
    [ Upstream commit 2ebbc9752d06bb1d01201fe632cb6da033b0248d ]
    
    Cited fixes commit introduced linecard notifications for register,
    however it didn't add them for unregister. Fix that by adding them.
    
    Fixes: c246f9b5fd61 ("devlink: add support to create line card and expose to user")
    Signed-off-by: Jiri Pirko <jiri@nvidia.com>
    Reviewed-by: Simon Horman <horms@kernel.org>
    Link: https://lore.kernel.org/r/20230817125240.2144794-1-jiri@resnulli.us
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 0f0dd7b19ec69cc8b8dfe04a0ffc2f00912b336d
Author: Hariprasad Kelam <hkelam@marvell.com>
Date:   Thu Aug 17 12:00:06 2023 +0530

    octeontx2-af: SDP: fix receive link config
    
    [ Upstream commit 05f3d5bc23524bed6f043dfe6b44da687584f9fb ]
    
    On SDP interfaces, frame oversize and undersize errors are
    observed as driver is not considering packet sizes of all
    subscribers of the link before updating the link config.
    
    This patch fixes the same.
    
    Fixes: 9b7dd87ac071 ("octeontx2-af: Support to modify min/max allowed packet lengths")
    Signed-off-by: Hariprasad Kelam <hkelam@marvell.com>
    Signed-off-by: Sunil Goutham <sgoutham@marvell.com>
    Reviewed-by: Leon Romanovsky <leonro@nvidia.com>
    Link: https://lore.kernel.org/r/20230817063006.10366-1-hkelam@marvell.com
    Signed-off-by: Jakub Kicinski <kuba@kernel.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 2242640e9bd94e706acf75c60a2ab1d0e150e0fb
Author: Zheng Yejian <zhengyejian1@huawei.com>
Date:   Thu Aug 17 20:55:39 2023 +0800

    tracing: Fix memleak due to race between current_tracer and trace
    
    [ Upstream commit eecb91b9f98d6427d4af5fdb8f108f52572a39e7 ]
    
    Kmemleak report a leak in graph_trace_open():
    
      unreferenced object 0xffff0040b95f4a00 (size 128):
        comm "cat", pid 204981, jiffies 4301155872 (age 99771.964s)
        hex dump (first 32 bytes):
          e0 05 e7 b4 ab 7d 00 00 0b 00 01 00 00 00 00 00 .....}..........
          f4 00 01 10 00 a0 ff ff 00 00 00 00 65 00 10 00 ............e...
        backtrace:
          [<000000005db27c8b>] kmem_cache_alloc_trace+0x348/0x5f0
          [<000000007df90faa>] graph_trace_open+0xb0/0x344
          [<00000000737524cd>] __tracing_open+0x450/0xb10
          [<0000000098043327>] tracing_open+0x1a0/0x2a0
          [<00000000291c3876>] do_dentry_open+0x3c0/0xdc0
          [<000000004015bcd6>] vfs_open+0x98/0xd0
          [<000000002b5f60c9>] do_open+0x520/0x8d0
          [<00000000376c7820>] path_openat+0x1c0/0x3e0
          [<00000000336a54b5>] do_filp_open+0x14c/0x324
          [<000000002802df13>] do_sys_openat2+0x2c4/0x530
          [<0000000094eea458>] __arm64_sys_openat+0x130/0x1c4
          [<00000000a71d7881>] el0_svc_common.constprop.0+0xfc/0x394
          [<00000000313647bf>] do_el0_svc+0xac/0xec
          [<000000002ef1c651>] el0_svc+0x20/0x30
          [<000000002fd4692a>] el0_sync_handler+0xb0/0xb4
          [<000000000c309c35>] el0_sync+0x160/0x180
    
    The root cause is descripted as follows:
    
      __tracing_open() {  // 1. File 'trace' is being opened;
        ...
        *iter->trace = *tr->current_trace;  // 2. Tracer 'function_graph' is
                                            //    currently set;
        ...
        iter->trace->open(iter);  // 3. Call graph_trace_open() here,
                                  //    and memory are allocated in it;
        ...
      }
    
      s_start() {  // 4. The opened file is being read;
        ...
        *iter->trace = *tr->current_trace;  // 5. If tracer is switched to
                                            //    'nop' or others, then memory
                                            //    in step 3 are leaked!!!
        ...
      }
    
    To fix it, in s_start(), close tracer before switching then reopen the
    new tracer after switching. And some tracers like 'wakeup' may not update
    'iter->private' in some cases when reopen, then it should be cleared
    to avoid being mistakenly closed again.
    
    Link: https://lore.kernel.org/linux-trace-kernel/20230817125539.1646321-1-zhengyejian1@huawei.com
    
    Fixes: d7350c3f4569 ("tracing/core: make the read callbacks reentrants")
    Signed-off-by: Zheng Yejian <zhengyejian1@huawei.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 49834a2c43d5543631b576345f743c587f5242f3
Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Wed Aug 16 17:49:28 2023 +0200

    tracing/synthetic: Allocate one additional element for size
    
    [ Upstream commit c4d6b5438116c184027b2e911c0f2c7c406fb47c ]
    
    While debugging another issue I noticed that the stack trace contains one
    invalid entry at the end:
    
    <idle>-0       [008] d..4.    26.484201: wake_lat: pid=0 delta=2629976084 000000009cc24024 stack=STACK:
    => __schedule+0xac6/0x1a98
    => schedule+0x126/0x2c0
    => schedule_timeout+0x150/0x2c0
    => kcompactd+0x9ca/0xc20
    => kthread+0x2f6/0x3d8
    => __ret_from_fork+0x8a/0xe8
    => 0x6b6b6b6b6b6b6b6b
    
    This is because the code failed to add the one element containing the
    number of entries to field_size.
    
    Link: https://lkml.kernel.org/r/20230816154928.4171614-4-svens@linux.ibm.com
    
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Fixes: 00cf3d672a9d ("tracing: Allow synthetic events to pass around stacktraces")
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 009e77a9169082c7c9813d4994530f020326b1e1
Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Wed Aug 16 17:49:27 2023 +0200

    tracing/synthetic: Skip first entry for stack traces
    
    [ Upstream commit 887f92e09ef34a949745ad26ce82be69e2dabcf6 ]
    
    While debugging another issue I noticed that the stack trace output
    contains the number of entries on top:
    
             <idle>-0       [000] d..4.   203.322502: wake_lat: pid=0 delta=2268270616 stack=STACK:
    => 0x10
    => __schedule+0xac6/0x1a98
    => schedule+0x126/0x2c0
    => schedule_timeout+0x242/0x2c0
    => __wait_for_common+0x434/0x680
    => __wait_rcu_gp+0x198/0x3e0
    => synchronize_rcu+0x112/0x138
    => ring_buffer_reset_online_cpus+0x140/0x2e0
    => tracing_reset_online_cpus+0x15c/0x1d0
    => tracing_set_clock+0x180/0x1d8
    => hist_register_trigger+0x486/0x670
    => event_hist_trigger_parse+0x494/0x1318
    => trigger_process_regex+0x1d4/0x258
    => event_trigger_write+0xb4/0x170
    => vfs_write+0x210/0xad0
    => ksys_write+0x122/0x208
    
    Fix this by skipping the first element. Also replace the pointer
    logic with an index variable which is easier to read.
    
    Link: https://lkml.kernel.org/r/20230816154928.4171614-3-svens@linux.ibm.com
    
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Fixes: 00cf3d672a9d ("tracing: Allow synthetic events to pass around stacktraces")
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 5c2d886ea8cd1b893cd00138688db6245abe76e3
Author: Sven Schnelle <svens@linux.ibm.com>
Date:   Wed Aug 16 17:49:26 2023 +0200

    tracing/synthetic: Use union instead of casts
    
    [ Upstream commit ddeea494a16f32522bce16ee65f191d05d4b8282 ]
    
    The current code uses a lot of casts to access the fields member in struct
    synth_trace_events with different sizes.  This makes the code hard to
    read, and had already introduced an endianness bug. Use a union and struct
    instead.
    
    Link: https://lkml.kernel.org/r/20230816154928.4171614-2-svens@linux.ibm.com
    
    Cc: stable@vger.kernel.org
    Cc: Masami Hiramatsu <mhiramat@kernel.org>
    Fixes: 00cf3d672a9dd ("tracing: Allow synthetic events to pass around stacktraces")
    Signed-off-by: Sven Schnelle <svens@linux.ibm.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Stable-dep-of: 887f92e09ef3 ("tracing/synthetic: Skip first entry for stack traces")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 299e0033f1bd5320279b2db971914374499f7e5f
Author: Zheng Yejian <zhengyejian1@huawei.com>
Date:   Sat Aug 5 11:38:15 2023 +0800

    tracing: Fix cpu buffers unavailable due to 'record_disabled' missed
    
    [ Upstream commit b71645d6af10196c46cbe3732de2ea7d36b3ff6d ]
    
    Trace ring buffer can no longer record anything after executing
    following commands at the shell prompt:
    
      # cd /sys/kernel/tracing
      # cat tracing_cpumask
      fff
      # echo 0 > tracing_cpumask
      # echo 1 > snapshot
      # echo fff > tracing_cpumask
      # echo 1 > tracing_on
      # echo "hello world" > trace_marker
      -bash: echo: write error: Bad file descriptor
    
    The root cause is that:
      1. After `echo 0 > tracing_cpumask`, 'record_disabled' of cpu buffers
         in 'tr->array_buffer.buffer' became 1 (see tracing_set_cpumask());
      2. After `echo 1 > snapshot`, 'tr->array_buffer.buffer' is swapped
         with 'tr->max_buffer.buffer', then the 'record_disabled' became 0
         (see update_max_tr());
      3. After `echo fff > tracing_cpumask`, the 'record_disabled' become -1;
    Then array_buffer and max_buffer are both unavailable due to value of
    'record_disabled' is not 0.
    
    To fix it, enable or disable both array_buffer and max_buffer at the same
    time in tracing_set_cpumask().
    
    Link: https://lkml.kernel.org/r/20230805033816.3284594-2-zhengyejian1@huawei.com
    
    Cc: <mhiramat@kernel.org>
    Cc: <vnagarnaik@google.com>
    Cc: <shuah@kernel.org>
    Fixes: 71babb2705e2 ("tracing: change CPU ring buffer state from tracing_cpumask")
    Signed-off-by: Zheng Yejian <zhengyejian1@huawei.com>
    Signed-off-by: Steven Rostedt (Google) <rostedt@goodmis.org>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit f3acc61309e002fa91bdf60d37deb33aedf3dc84
Author: Randy Dunlap <rdunlap@infradead.org>
Date:   Fri Aug 11 22:29:47 2023 -0700

    wifi: iwlwifi: mvm: add dependency for PTP clock
    
    [ Upstream commit 609a1bcd7bebac90a1b443e9fed47fd48dac5799 ]
    
    When the code to use the PTP HW clock was added, it didn't update
    the Kconfig entry for the PTP dependency, leading to build errors,
    so update the Kconfig entry to depend on PTP_1588_CLOCK_OPTIONAL.
    
    aarch64-linux-ld: drivers/net/wireless/intel/iwlwifi/mvm/ptp.o: in function `iwl_mvm_ptp_init':
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:294: undefined reference to `ptp_clock_register'
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:294:(.text+0xce8): relocation truncated to fit: R_AARCH64_CALL26 against undefined symbol `ptp_clock_register'
    aarch64-linux-ld: drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:301: undefined reference to `ptp_clock_index'
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:301:(.text+0xd18): relocation truncated to fit: R_AARCH64_CALL26 against undefined symbol `ptp_clock_index'
    aarch64-linux-ld: drivers/net/wireless/intel/iwlwifi/mvm/ptp.o: in function `iwl_mvm_ptp_remove':
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:315: undefined reference to `ptp_clock_index'
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:315:(.text+0xe80): relocation truncated to fit: R_AARCH64_CALL26 against undefined symbol `ptp_clock_index'
    aarch64-linux-ld: drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:319: undefined reference to `ptp_clock_unregister'
    drivers/net/wireless/intel/iwlwifi/mvm/ptp.c:319:(.text+0xeac): relocation truncated to fit: R_AARCH64_CALL26 against undefined symbol `ptp_clock_unregister'
    
    Fixes: 1595ecce1cf3 ("wifi: iwlwifi: mvm: add support for PTP HW clock (PHC)")
    Signed-off-by: Randy Dunlap <rdunlap@infradead.org>
    Reported-by: kernel test robot <lkp@intel.com>
    Link: https://lore.kernel.org/all/202308110447.4QSJHmFH-lkp@intel.com/
    Cc: Krishnanand Prabhu <krishnanand.prabhu@intel.com>
    Cc: Luca Coelho <luciano.coelho@intel.com>
    Cc: Gregory Greenman <gregory.greenman@intel.com>
    Cc: Johannes Berg <johannes.berg@intel.com>
    Cc: Kalle Valo <kvalo@kernel.org>
    Cc: linux-wireless@vger.kernel.org
    Cc: "David S. Miller" <davem@davemloft.net>
    Cc: Eric Dumazet <edumazet@google.com>
    Cc: Jakub Kicinski <kuba@kernel.org>
    Cc: Paolo Abeni <pabeni@redhat.com>
    Cc: netdev@vger.kernel.org
    Reviewed-by: Simon Horman <horms@kernel.org>
    Tested-by: Simon Horman <horms@kernel.org> # build-tested
    Acked-by: Richard Cochran <richardcochran@gmail.com>
    Acked-by: Gregory Greenman <gregory.greenman@intel.com>
    Link: https://lore.kernel.org/r/20230812052947.22913-1-rdunlap@infradead.org
    Signed-off-by: Johannes Berg <johannes.berg@intel.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 7f35e56117309edd135d58a5d2c9e309360efd61
Author: Eric Dumazet <edumazet@google.com>
Date:   Thu Jul 20 11:44:38 2023 +0000

    can: raw: fix lockdep issue in raw_release()
    
    [ Upstream commit 11c9027c983e9e4b408ee5613b6504d24ebd85be ]
    
    syzbot complained about a lockdep issue [1]
    
    Since raw_bind() and raw_setsockopt() first get RTNL
    before locking the socket, we must adopt the same order in raw_release()
    
    [1]
    WARNING: possible circular locking dependency detected
    6.5.0-rc1-syzkaller-00192-g78adb4bcf99e #0 Not tainted
    ------------------------------------------------------
    syz-executor.0/14110 is trying to acquire lock:
    ffff88804e4b6130 (sk_lock-AF_CAN){+.+.}-{0:0}, at: lock_sock include/net/sock.h:1708 [inline]
    ffff88804e4b6130 (sk_lock-AF_CAN){+.+.}-{0:0}, at: raw_bind+0xb1/0xab0 net/can/raw.c:435
    
    but task is already holding lock:
    ffffffff8e3df368 (rtnl_mutex){+.+.}-{3:3}, at: raw_bind+0xa7/0xab0 net/can/raw.c:434
    
    which lock already depends on the new lock.
    
    the existing dependency chain (in reverse order) is:
    
    -> #1 (rtnl_mutex){+.+.}-{3:3}:
    __mutex_lock_common kernel/locking/mutex.c:603 [inline]
    __mutex_lock+0x181/0x1340 kernel/locking/mutex.c:747
    raw_release+0x1c6/0x9b0 net/can/raw.c:391
    __sock_release+0xcd/0x290 net/socket.c:654
    sock_close+0x1c/0x20 net/socket.c:1386
    __fput+0x3fd/0xac0 fs/file_table.c:384
    task_work_run+0x14d/0x240 kernel/task_work.c:179
    resume_user_mode_work include/linux/resume_user_mode.h:49 [inline]
    exit_to_user_mode_loop kernel/entry/common.c:171 [inline]
    exit_to_user_mode_prepare+0x210/0x240 kernel/entry/common.c:204
    __syscall_exit_to_user_mode_work kernel/entry/common.c:286 [inline]
    syscall_exit_to_user_mode+0x1d/0x50 kernel/entry/common.c:297
    do_syscall_64+0x44/0xb0 arch/x86/entry/common.c:86
    entry_SYSCALL_64_after_hwframe+0x63/0xcd
    
    -> #0 (sk_lock-AF_CAN){+.+.}-{0:0}:
    check_prev_add kernel/locking/lockdep.c:3142 [inline]
    check_prevs_add kernel/locking/lockdep.c:3261 [inline]
    validate_chain kernel/locking/lockdep.c:3876 [inline]
    __lock_acquire+0x2e3d/0x5de0 kernel/locking/lockdep.c:5144
    lock_acquire kernel/locking/lockdep.c:5761 [inline]
    lock_acquire+0x1ae/0x510 kernel/locking/lockdep.c:5726
    lock_sock_nested+0x3a/0xf0 net/core/sock.c:3492
    lock_sock include/net/sock.h:1708 [inline]
    raw_bind+0xb1/0xab0 net/can/raw.c:435
    __sys_bind+0x1ec/0x220 net/socket.c:1792
    __do_sys_bind net/socket.c:1803 [inline]
    __se_sys_bind net/socket.c:1801 [inline]
    __x64_sys_bind+0x72/0xb0 net/socket.c:1801
    do_syscall_x64 arch/x86/entry/common.c:50 [inline]
    do_syscall_64+0x38/0xb0 arch/x86/entry/common.c:80
    entry_SYSCALL_64_after_hwframe+0x63/0xcd
    
    other info that might help us debug this:
    
    Possible unsafe locking scenario:
    
    CPU0 CPU1
    ---- ----
    lock(rtnl_mutex);
            lock(sk_lock-AF_CAN);
            lock(rtnl_mutex);
    lock(sk_lock-AF_CAN);
    
    *** DEADLOCK ***
    
    1 lock held by syz-executor.0/14110:
    
    stack backtrace:
    CPU: 0 PID: 14110 Comm: syz-executor.0 Not tainted 6.5.0-rc1-syzkaller-00192-g78adb4bcf99e #0
    Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 07/03/2023
    Call Trace:
    <TASK>
    __dump_stack lib/dump_stack.c:88 [inline]
    dump_stack_lvl+0xd9/0x1b0 lib/dump_stack.c:106
    check_noncircular+0x311/0x3f0 kernel/locking/lockdep.c:2195
    check_prev_add kernel/locking/lockdep.c:3142 [inline]
    check_prevs_add kernel/locking/lockdep.c:3261 [inline]
    validate_chain kernel/locking/lockdep.c:3876 [inline]
    __lock_acquire+0x2e3d/0x5de0 kernel/locking/lockdep.c:5144
    lock_acquire kernel/locking/lockdep.c:5761 [inline]
    lock_acquire+0x1ae/0x510 kernel/locking/lockdep.c:5726
    lock_sock_nested+0x3a/0xf0 net/core/sock.c:3492
    lock_sock include/net/sock.h:1708 [inline]
    raw_bind+0xb1/0xab0 net/can/raw.c:435
    __sys_bind+0x1ec/0x220 net/socket.c:1792
    __do_sys_bind net/socket.c:1803 [inline]
    __se_sys_bind net/socket.c:1801 [inline]
    __x64_sys_bind+0x72/0xb0 net/socket.c:1801
    do_syscall_x64 arch/x86/entry/common.c:50 [inline]
    do_syscall_64+0x38/0xb0 arch/x86/entry/common.c:80
    entry_SYSCALL_64_after_hwframe+0x63/0xcd
    RIP: 0033:0x7fd89007cb29
    Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 20 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
    RSP: 002b:00007fd890d2a0c8 EFLAGS: 00000246 ORIG_RAX: 0000000000000031
    RAX: ffffffffffffffda RBX: 00007fd89019bf80 RCX: 00007fd89007cb29
    RDX: 0000000000000010 RSI: 0000000020000040 RDI: 0000000000000003
    RBP: 00007fd8900c847a R08: 0000000000000000 R09: 0000000000000000
    R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000000
    R13: 000000000000000b R14: 00007fd89019bf80 R15: 00007ffebf8124f8
    </TASK>
    
    Fixes: ee8b94c8510c ("can: raw: fix receiver memory leak")
    Reported-by: syzbot <syzkaller@googlegroups.com>
    Signed-off-by: Eric Dumazet <edumazet@google.com>
    Cc: Ziyang Xuan <william.xuanziyang@huawei.com>
    Cc: Oliver Hartkopp <socketcan@hartkopp.net>
    Cc: stable@vger.kernel.org
    Cc: Marc Kleine-Budde <mkl@pengutronix.de>
    Link: https://lore.kernel.org/all/20230720114438.172434-1-edumazet@google.com
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit c8ddbaec835a332fb597780c3b165a505584a403
Author: Ziyang Xuan <william.xuanziyang@huawei.com>
Date:   Tue Jul 11 09:17:37 2023 +0800

    can: raw: fix receiver memory leak
    
    [ Upstream commit ee8b94c8510ce64afe0b87ef548d23e00915fb10 ]
    
    Got kmemleak errors with the following ltp can_filter testcase:
    
    for ((i=1; i<=100; i++))
    do
            ./can_filter &
            sleep 0.1
    done
    
    ==============================================================
    [<00000000db4a4943>] can_rx_register+0x147/0x360 [can]
    [<00000000a289549d>] raw_setsockopt+0x5ef/0x853 [can_raw]
    [<000000006d3d9ebd>] __sys_setsockopt+0x173/0x2c0
    [<00000000407dbfec>] __x64_sys_setsockopt+0x61/0x70
    [<00000000fd468496>] do_syscall_64+0x33/0x40
    [<00000000b7e47d51>] entry_SYSCALL_64_after_hwframe+0x61/0xc6
    
    It's a bug in the concurrent scenario of unregister_netdevice_many()
    and raw_release() as following:
    
                 cpu0                                        cpu1
    unregister_netdevice_many(can_dev)
      unlist_netdevice(can_dev) // dev_get_by_index() return NULL after this
      net_set_todo(can_dev)
                                                    raw_release(can_socket)
                                                      dev = dev_get_by_index(, ro->ifindex); // dev == NULL
                                                      if (dev) { // receivers in dev_rcv_lists not free because dev is NULL
                                                        raw_disable_allfilters(, dev, );
                                                        dev_put(dev);
                                                      }
                                                      ...
                                                      ro->bound = 0;
                                                      ...
    
    call_netdevice_notifiers(NETDEV_UNREGISTER, )
      raw_notify(, NETDEV_UNREGISTER, )
        if (ro->bound) // invalid because ro->bound has been set 0
          raw_disable_allfilters(, dev, ); // receivers in dev_rcv_lists will never be freed
    
    Add a net_device pointer member in struct raw_sock to record bound
    can_dev, and use rtnl_lock to serialize raw_socket members between
    raw_bind(), raw_release(), raw_setsockopt() and raw_notify(). Use
    ro->dev to decide whether to free receivers in dev_rcv_lists.
    
    Fixes: 8d0caedb7596 ("can: bcm/raw/isotp: use per module netdevice notifier")
    Reviewed-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Acked-by: Oliver Hartkopp <socketcan@hartkopp.net>
    Signed-off-by: Ziyang Xuan <william.xuanziyang@huawei.com>
    Link: https://lore.kernel.org/all/20230711011737.1969582-1-william.xuanziyang@huawei.com
    Cc: stable@vger.kernel.org
    Signed-off-by: Marc Kleine-Budde <mkl@pengutronix.de>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 019b59aeb2af6b47d5c8e69c5dc1d731c8df0354
Author: Zhang Yi <yi.zhang@huawei.com>
Date:   Tue Jun 6 21:59:27 2023 +0800

    jbd2: fix a race when checking checkpoint buffer busy
    
    [ Upstream commit 46f881b5b1758dc4a35fba4a643c10717d0cf427 ]
    
    Before removing checkpoint buffer from the t_checkpoint_list, we have to
    check both BH_Dirty and BH_Lock bits together to distinguish buffers
    have not been or were being written back. But __cp_buffer_busy() checks
    them separately, it first check lock state and then check dirty, the
    window between these two checks could be raced by writing back
    procedure, which locks buffer and clears buffer dirty before I/O
    completes. So it cannot guarantee checkpointing buffers been written
    back to disk if some error happens later. Finally, it may clean
    checkpoint transactions and lead to inconsistent filesystem.
    
    jbd2_journal_forget() and __journal_try_to_free_buffer() also have the
    same problem (journal_unmap_buffer() escape from this issue since it's
    running under the buffer lock), so fix them through introducing a new
    helper to try holding the buffer lock and remove really clean buffer.
    
    Link: https://bugzilla.kernel.org/show_bug.cgi?id=217490
    Cc: stable@vger.kernel.org
    Suggested-by: Jan Kara <jack@suse.cz>
    Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
    Reviewed-by: Jan Kara <jack@suse.cz>
    Link: https://lore.kernel.org/r/20230606135928.434610-6-yi.zhang@huaweicloud.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 557fda9ed70ebf8eda2620ba3d746215285a1303
Author: Zhang Yi <yi.zhang@huawei.com>
Date:   Tue Jun 6 21:59:25 2023 +0800

    jbd2: remove journal_clean_one_cp_list()
    
    [ Upstream commit b98dba273a0e47dbfade89c9af73c5b012a4eabb ]
    
    journal_clean_one_cp_list() and journal_shrink_one_cp_list() are almost
    the same, so merge them into journal_shrink_one_cp_list(), remove the
    nr_to_scan parameter, always scan and try to free the whole checkpoint
    list.
    
    Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
    Reviewed-by: Jan Kara <jack@suse.cz>
    Link: https://lore.kernel.org/r/20230606135928.434610-4-yi.zhang@huaweicloud.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Stable-dep-of: 46f881b5b175 ("jbd2: fix a race when checking checkpoint buffer busy")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 2968fec1d56f1131d3b7b69348e6397f5042521e
Author: Zhang Yi <yi.zhang@huawei.com>
Date:   Tue Jun 6 21:59:24 2023 +0800

    jbd2: remove t_checkpoint_io_list
    
    [ Upstream commit be22255360f80d3af789daad00025171a65424a5 ]
    
    Since t_checkpoint_io_list was stop using in jbd2_log_do_checkpoint()
    now, it's time to remove the whole t_checkpoint_io_list logic.
    
    Signed-off-by: Zhang Yi <yi.zhang@huawei.com>
    Reviewed-by: Jan Kara <jack@suse.cz>
    Link: https://lore.kernel.org/r/20230606135928.434610-3-yi.zhang@huaweicloud.com
    Signed-off-by: Theodore Ts'o <tytso@mit.edu>
    Stable-dep-of: 46f881b5b175 ("jbd2: fix a race when checking checkpoint buffer busy")
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 1d9995c2ac8060f005d206ab93e155de1fc2339e
Author: Igor Mammedov <imammedo@redhat.com>
Date:   Mon Apr 24 21:15:57 2023 +0200

    PCI: acpiphp: Reassign resources on bridge if necessary
    
    [ Upstream commit 40613da52b13fb21c5566f10b287e0ca8c12c4e9 ]
    
    When using ACPI PCI hotplug, hotplugging a device with large BARs may fail
    if bridge windows programmed by firmware are not large enough.
    
    Reproducer:
      $ qemu-kvm -monitor stdio -M q35  -m 4G \
          -global ICH9-LPC.acpi-pci-hotplug-with-bridge-support=on \
          -device id=rp1,pcie-root-port,bus=pcie.0,chassis=4 \
          disk_image
    
     wait till linux guest boots, then hotplug device:
       (qemu) device_add qxl,bus=rp1
    
     hotplug on guest side fails with:
       pci 0000:01:00.0: [1b36:0100] type 00 class 0x038000
       pci 0000:01:00.0: reg 0x10: [mem 0x00000000-0x03ffffff]
       pci 0000:01:00.0: reg 0x14: [mem 0x00000000-0x03ffffff]
       pci 0000:01:00.0: reg 0x18: [mem 0x00000000-0x00001fff]
       pci 0000:01:00.0: reg 0x1c: [io  0x0000-0x001f]
       pci 0000:01:00.0: BAR 0: no space for [mem size 0x04000000]
       pci 0000:01:00.0: BAR 0: failed to assign [mem size 0x04000000]
       pci 0000:01:00.0: BAR 1: no space for [mem size 0x04000000]
       pci 0000:01:00.0: BAR 1: failed to assign [mem size 0x04000000]
       pci 0000:01:00.0: BAR 2: assigned [mem 0xfe800000-0xfe801fff]
       pci 0000:01:00.0: BAR 3: assigned [io  0x1000-0x101f]
       qxl 0000:01:00.0: enabling device (0000 -> 0003)
       Unable to create vram_mapping
       qxl: probe of 0000:01:00.0 failed with error -12
    
    However when using native PCIe hotplug
      '-global ICH9-LPC.acpi-pci-hotplug-with-bridge-support=off'
    it works fine, since kernel attempts to reassign unused resources.
    
    Use the same machinery as native PCIe hotplug to (re)assign resources.
    
    Link: https://lore.kernel.org/r/20230424191557.2464760-1-imammedo@redhat.com
    Signed-off-by: Igor Mammedov <imammedo@redhat.com>
    Signed-off-by: Bjorn Helgaas <bhelgaas@google.com>
    Acked-by: Michael S. Tsirkin <mst@redhat.com>
    Acked-by: Rafael J. Wysocki <rafael@kernel.org>
    Cc: stable@vger.kernel.org
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit a7342df30797b8e62d2c67cf8d7a34f4ecfbd875
Author: Chuck Lever <chuck.lever@oracle.com>
Date:   Mon Jul 3 14:18:29 2023 -0400

    xprtrdma: Remap Receive buffers after a reconnect
    
    [ Upstream commit 895cedc1791916e8a98864f12b656702fad0bb67 ]
    
    On server-initiated disconnect, rpcrdma_xprt_disconnect() was DMA-
    unmapping the Receive buffers, but rpcrdma_post_recvs() neglected
    to remap them after a new connection had been established. The
    result was immediate failure of the new connection with the Receives
    flushing with LOCAL_PROT_ERR.
    
    Fixes: 671c450b6fe0 ("xprtrdma: Fix oops in Receive handler after device removal")
    Signed-off-by: Chuck Lever <chuck.lever@oracle.com>
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit ef65498c8087bfe87bec01299b30ae7d683d1ccd
Author: Fedor Pchelkin <pchelkin@ispras.ru>
Date:   Tue Jul 25 14:59:30 2023 +0300

    NFSv4: fix out path in __nfs4_get_acl_uncached
    
    [ Upstream commit f4e89f1a6dab4c063fc1e823cc9dddc408ff40cf ]
    
    Another highly rare error case when a page allocating loop (inside
    __nfs4_get_acl_uncached, this time) is not properly unwound on error.
    Since pages array is allocated being uninitialized, need to free only
    lower array indices. NULL checks were useful before commit 62a1573fcf84
    ("NFSv4 fix acl retrieval over krb5i/krb5p mounts") when the array had
    been initialized to zero on stack.
    
    Found by Linux Verification Center (linuxtesting.org).
    
    Fixes: 62a1573fcf84 ("NFSv4 fix acl retrieval over krb5i/krb5p mounts")
    Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Reviewed-by: Benjamin Coddington <bcodding@redhat.com>
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

commit 9e2388d814ac7a93ca75ec9ed3602c37b760a8d3
Author: Fedor Pchelkin <pchelkin@ispras.ru>
Date:   Tue Jul 25 14:58:58 2023 +0300

    NFSv4.2: fix error handling in nfs42_proc_getxattr
    
    [ Upstream commit 4e3733fd2b0f677faae21cf838a43faf317986d3 ]
    
    There is a slight issue with error handling code inside
    nfs42_proc_getxattr(). If page allocating loop fails then we free the
    failing page array element which is NULL but __free_page() can't deal with
    NULL args.
    
    Found by Linux Verification Center (linuxtesting.org).
    
    Fixes: a1f26739ccdc ("NFSv4.2: improve page handling for GETXATTR")
    Signed-off-by: Fedor Pchelkin <pchelkin@ispras.ru>
    Reviewed-by: Benjamin Coddington <bcodding@redhat.com>
    Signed-off-by: Trond Myklebust <trond.myklebust@hammerspace.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>