Constant BSOD crashes

Page 1 of 11 123 ... LastLast

  1. Posts : 47
    Windos 10 x64
       #1

    Constant BSOD crashes


    Hey everyone, for the past 8 month (since I got this new computer) I've been getting constant BSODS (usually IRQL NOT LESS OR EQUAL) and I'm not able to locate the cause.

    I have run memtest86 and no errors occurred

    I tried switching drivers, and still the bsod's continue...

    In the past few weeks I've been also getting (CRITICAL PROCESS DIED) and what no really..

    Thanks in advance for your help!

    Windows Version: 20H2

    OS build: 19042.572

    Link to the verifier .zip - google drive:
    Verifier-Log - Google Drive
      My Computer


  2. Posts : 1,538
    Windows 8.1 Enterprise x64
       #2
      My Computer


  3. Posts : 47
    Windos 10 x64
    Thread Starter
       #3

    MrPepka said:
    Thank you for your quick response -
    I'll update the drivers and hope that this will fix the problem :)

    I'd really appreciate it if you can elaborate on how you understood this was the problem, since I tried to debug it myself and couldn't find the drivers that are causing the problem.

    - - - Updated - - -

    Unfortunately after updating both drivers, I got another BSOD today
    A new type though - unexpected kernel mode trap.

    Here's the log zip:
    DESKTOP-761T659-(2020-10-22_17-21-49).zip - Google Drive

    It's in the same folder as the link in the original post...

    Thanks in advance for any leads...
      My Computer


  4. Posts : 14,046
    Windows 11 Pro X64 22H2 22621.1848
       #4

    Are you doing any overclocking, over/under volting? If you are, reset everything to nominal.
    If you are using memory XMP profile, disable for now.

    The 10/19 8:52AM dump shows this:
    Code:
    INTERNAL_POWER_ERROR (a0)
    The power policy manager experienced a fatal error.
    Arguments:
    Arg1: 0000000000000107, A data mismatch has occurred in the internal hibernation data
        structures.
    Arg2: 000000000000000a
    Arg3: ffffd2852d3dc6d0, POP_HIBER_CONTEXT
    Arg4: 0000000000000000, PO_MEMORY_RANGE_ARRAY
    This can be caused by a variety of things, memory, hard drive, PSU or CPU.

    The first thing I'd do is disable Hibernate which will also disable Fast Startup which is another thing I want you to do anyway,

    Open a Elevated Command Prompt.
    Type powercfg -h off and press Enter
    Close the Command Prompt window.

    See if that makes any difference.

    The 10/22 5:20PM dump has this:
    Code:
    UNEXPECTED_KERNEL_MODE_TRAP (7f)
    This means a trap occurred in kernel mode, and it's a trap of a kind
    that the kernel isn't allowed to have/catch (bound trap) or that
    is always instant death (double fault).  The first number in the
    bugcheck params is the number of the trap (8 = double fault, etc)
    Consult an Intel x86 family manual to learn more about what these
    traps are. Here is a *portion* of those codes:
    If kv shows a taskGate
            use .tss on the part before the colon, then kv.
    Else if kv shows a trapframe
            use .trap on that value
    Else
            .trap on the appropriate frame will show where the trap was taken
            (on x86, this will be the ebp that goes with the procedure KiTrap)
    Endif
    kb will then show the corrected stack.
    Arguments:
    Arg1: 0000000000000008, EXCEPTION_DOUBLE_FAULT
    Arg2: fffff80481d07e70
    Arg3: fffff60052b02000
    Arg4: fffff8047eaac788

    A lot of information here that I've found but I thought you might like to have it:


    Bug Check 0x7F: UNEXPECTED_KERNEL_MODE_TRAP

    The UNEXPECTED_KERNEL_MODE_TRAP bug check has a value of 0x0000007F. This bug check indicates that the CPU generated a trap and the kernel failed to catch this trap. This trap could be a bound trap (a trap the kernel is not permitted to catch) or a double fault (a fault that occurred while processing an earlier fault, which always results in a system failure).

    0x00000008, or Double Fault, indicates that an exception occurs during a call to the handler for a prior exception. Typically, the two exceptions are handled serially. However, there are several exceptions that cannot be handled serially, and in this situation the processor signals a double fault. There are two common causes of a double fault:

    • A kernel stack overflow. This overflow occurs when a guard page is hit, and the kernel tries to push a trap frame. Because there is no stack left, a stack overflow results, causing the double fault. If you think this overview has occurred, use !thread to determine the stack limits, and then use kb (Display Stack Backtrace) with a large parameter (for example, kb 100) to display the full stack.
    • A hardware problem.


    Cause

    Bug check 0x7F typically occurs after you install a faulty or mismatched hardware (especially memory) or if installed hardware fails.
    A double fault can occur when the kernel stack overflows. This overflow occurs if multiple drivers are attached to the same stack. For example, if two file system filter drivers are attached to the same stack and then the file system recurses back in, the stack overflows.

    Troubleshooting: If you recently added hardware to the computer, remove it to see if the error recurs. If existing hardware has failed, remove or replace the faulty component. Run hardware diagnostics that the system manufacturer supplies to determine which hardware component failed.

    The memory scanner is especially important. Faulty or mismatched memory can cause this bug check. For more information about these procedures, see the owner's manual for your computer. Check that all adapter cards in the computer are properly seated. Use an ink eraser or an electrical contact treatment, available at electronics supply stores, to ensure adapter card contacts are clean.

    If the error appears on a newly installed system, check the availability of updates for the BIOS, the SCSI controller, or network cards. These kind of updates are typically available on the Web site or BBS of the hardware manufacturer.

    Confirm that all hard disk drives, hard disk controllers, and SCSI adapters are compatible with the installed version of Windows. For example, you can get information about compatibility with Windows 7 at the Windows 7 Compatibility Center.

    If the error occurred after the installation of a new or updated device driver, you should remove or replace the driver. If, under this circumstance, the error occurs during the startup sequence and the system partition is formatted with NTFS, you might be able to use Safe Mode to rename or delete the faulty driver. If the driver is used as part of the system startup process in Safe Mode, you have to start the computer by using the Recovery Console in order to access the file.

    Also restart your computer, and then press F8 at the character-based menu that displays the operating system choices. At the Advanced Options menu, select the Last Known Good Configuration option. This option is most effective when you add only one driver or service at a time.
    Overclocking (setting the CPU to run at speeds above the rated specification) can cause this error. If you have overclocked the computer that is experiencing the error, return the CPU to the default clock speed setting.

    Check the System Log in Event Viewer for additional error messages that might help identify the device or driver that is causing the error. You can also disable memory caching of the BIOS to try to resolve the problem.

    If you encountered this error while upgrading to a new version of the Windows operating system, the error might be caused by a device driver, a system service, a virus scanner, or a backup tool that is incompatible with the new version. If possible, remove all third-party device drivers and system services and disable any virus scanners before you upgrade. Contact the software manufacturer to obtain updates of these tools. Also make sure that you have installed the latest Windows Service Pack.
    Finally, if all the above steps do not resolve the error, take the system motherboard to a repair facility for diagnostic testing. A crack, a scratched trace, or a defective component on the motherboard can also cause this error.


    So, in addition, it would be a very good idea to run memtestx86. You have 64GB of ram so this is going to take a very long time.
    A better method might be to remove half of the memory from alternates slots and run the system as usual and see if problems occur for a week or so.
    If not, swap installed and removed ram and test for another week.
    If you get problems with either set of ram modules, then remove one of the two and test.
    Swap as above until you narrow it down to a failing single memory module if possible.

    This will take time but this way you will be able to continue to use the system as normal. If you prefer to run memtest86, here is the procedure:

    ===================================================
    Up through DDR3, follow this tutorial: MemTest86+ - Test RAM - Windows 10 Forums
    DDR4, follow this tutorial: MemTest86 v8.1 Free Memory Testing Tool Review

    Both of these are diagnostic tools designed to test Random Access Memory (RAM) for faults. They will verify that:


    • RAM will accept and keep random patterns of data sent to it
    • There are no errors when different parts of memory try to interact
    • There are no conflicts between memory addresses

    They run from bootable media to isolate the RAM from the system, no other components are taken into account during the test.

    warning   Warning
    MemTest86+ and Memtest86 need to run for at least 8 passes to be anywhere near conclusive, anything less will not give a complete analysis of the RAM. The new version of Memtest86 only allows 4 passes for the free version so run it twice, back to back.

    If you are asked to run MemTest86+ or Memtest86 by a Ten Forums member make sure you run the full 8 (or 4+4) passes for conclusive results. If you run less than 8 passes you will be asked to run it again.

    Note   Note
    Both of these diagnostics has been known to discover errors in RAM in later passes than the eighth pass. This is for information only; if you feel there is a definite problem with the RAM and 8 passes have shown no errors feel free to continue for longer.

    Running 8 passes of MemTest86+ or 4+4 passes of Memtest86 is a long and drawn out exercise and the more RAM you have the longer it will take. It's recommended to run it overnight, starting just before you go to bed and leave it overnight.

    Take a picture when done and post in the forum please.
      My Computers


  5. Posts : 47
    Windos 10 x64
    Thread Starter
       #5

    First of all, thanks for the detailed response Ztruker.

    Turning the computer on today caused another BSOD (IRQL NOT LESS OR EQUAL) after opening a browser so I'm adding the log here...
    Link to Log: DESKTOP-761T659-(2020-10-23_09-41-19).zip - Google Drive

    I am not overclocking anything, except updating the bios 2 months ago when the last update was available I didn't touch any settings - I'll try to return to the settings as you recommended.

    I turned off the fast start up from the command line and hope it'll help.

    I'll remove two of the memory sticks and see if that helps... And I'll run MemTest 86 on the two sticks that are on the computer.
      My Computer


  6. Posts : 14,046
    Windows 11 Pro X64 22H2 22621.1848
       #6

    Great, good plan. I'll look at the latest dump.

    Edit: Not seeing anything new in the dump or associated files.

    If testing ram doesn't show anything then need to test CPU, GPU and Hard Drive. There are no motherboard tests unfortunately.
    We can also enable Driver Verifier to see if a device driver is causing the problem.

    I'd say take em in order:

    ===================================================
    Follow this tutorial to stress test your CPU:

    Prime95 - Stress Test Your CPU - Windows 10 Forums

    OCCT: OCBASE / OCCT

    For both the above, read the instructions carefully and watch your system when testing.
    If it starts spitting out problems and you can't shut it down - simply turn the system off/unplug it from the wall

    For Intel CPUs (32 and 64 bit): Intel® Processor Diagnostic Tool

    The purpose of the Intel® Processor Diagnostic Tool is to verify the functionality of an Intel® microprocessor. The diagnostic tool checks for brand identification, verifies the processor operating frequency, tests specific processor features, and performs a stress test on the processor.

    ===================================================
    Follow this tutorials to stress test your GPU:
    FurMark - GPU Stress Test - Windows 10 Forums

    If you have a AMD APU instead of a discreet GPU, try UNIGINE 3D Benchmark..

    Read the instructions carefully and watch your system when testing.
    If it starts spitting out problems and you can't shut it down - simply turn the system off/unplug it from the wall

    ===================================================
    Run Driver Verifier

    Driver Verifier-- tracking down a mis-behaving driver.

    Following the instructions in the above link will check for problems in all non-Microsoft drivers.

    What we're looking for is a verifier generated BSOD with a mini dump that will tell us what driver caused it. If you get a BSOD, rerun the V2 log collector as soon as possible and upload the resulting zip file. Also see if there is a new C:\Windows \MEMORY.DMP file. If there is, copy it to another location then zip it an upload to a file sharing site like OneDrive and post a link to it here.

    To check if verifier is active, open a Command Prompt and enter:

    verifier /query

    If not active it will respond with

    No drivers are currently verified.

    Follow instructions here to decide which drivers you want to test. 3-5 is a good start. Add 3-5 more every two hours until all are selected or you get a BSOD.

    Enable and Disable Driver Verifier in Windows 10

    Familiarize yourself with how to get out of Driver verifier from the above link and from the first link.

    ===================================================

    Diagnostics Test
    HDD TEST

    Run HDTune to

    • check the health,
    • scan for errors, not quick scan but full scan
    • run a benchmark.

    Run CrystalDiskInfo Portable and CrystalDiskMark Portable to

    • Check health of drive
    • Check performance of drive

    Run SeaGate SeaTools for Windows (test ALL hard drives - HDD/SSD/USB/etc)

    You can also try the various options available with chkdsk.
    Open a Elevated Command Prompt.
    Type chkdsk C: /F /R and press Enter.

    You'll get the following message:
    Chkdsk cannot run because the volume is in use by another process.
    Would you like to schedule this volume to be checked the next time the system restarts? (Y/N)

    Type Y and press Enter.
    Reboot and chkdsk will run. Let it run to completion.

    It can take awhile depending on the size of the disk. It's difficult to capture the chkdsk output so once the system is booted, download this batch file and run it:

    chkdskFromEvent.bat - Click here to go to the BSOD batch repository to download and run this batch file.

    Work through the above as time permits. Post results in the forum, screen captures or cell phone pictures.
    Last edited by Ztruker; 23 Oct 2020 at 13:56.
      My Computers


  7. Posts : 47
    Windos 10 x64
    Thread Starter
       #7

    Unfortunately - another crash.
    Things I did since the last post:
    - Took 2 of the 4 ram sticks out.
    - ran a GPU stress test, no crash during the test (temps stabilized at around 77 degrees at 100% power)
    - tonight I'll run the free MemTest 86 test again on these 2 RAM sticks... if needed I'll buy the pro version.

    Attaching the log here:
    DESKTOP-761T659-(2020-10-26_21-33-02).zip - Google Drive

    Thanks in advance for any leads :)
      My Computer


  8. Posts : 14,046
    Windows 11 Pro X64 22H2 22621.1848
       #8

    Dump is messed up, getting a lot of Symbol file problems.

    Please try Driver Verifier next when you get time.
      My Computers


  9. Posts : 47
    Windos 10 x64
    Thread Starter
       #9

    First, thanks - I'll turn Driver verifier tonight.
    But this morning when I woke the computer from sleep mode it instantly BSOD saying the problematic driver was storport.sys
    now I can't upload the log - because the dump file is all messed up again...
    Any method you know of to fix the symbol warning that messes up these dump files?
      My Computer


  10. Posts : 41,475
    windows 10 professional version 1607 build 14393.969 64 bit
       #10

    Please run: Tuneup.bat - Click here to go to the BSOD batch repository to download and run this batch file.



    In the left lower corner search type: system or system control > open system control panel > on the left pane click advanced system settings

    a) > on the advanced tab under startup and recovery > click settings > post an image of the startup and recovery window into the thread

    b) > on the advanced tab under performance > click on settings > on the performance options window > click on the advanced tab > under virtual memory > click on change > post an image of the virtual memory window into the thread



    For any BSOD:

    a) run the V2 log collector to collect new log files

    b) open file explorer> this PC > C: > in the right upper corner search for: C:\Windows\memory.dmp
    > if the file size is < 1.5 GB then zip > post a separate share link into the thread using one drive, drop box, or google drive
      My Computer


 

  Related Discussions
Our Sites
Site Links
About Us
Windows 10 Forums is an independent web site and has not been authorized, sponsored, or otherwise approved by Microsoft Corporation. "Windows 10" and related materials are trademarks of Microsoft Corp.

© Designer Media Ltd
All times are GMT -5. The time now is 12:47.
Find Us




Windows 10 Forums