Visit Open-E website
Results 1 to 10 of 27

Thread: Kernel error kernel:[360783.353677]

Hybrid View

Previous Post Previous Post   Next Post Next Post
  1. #1
    Join Date
    Oct 2009
    Posts
    53

    Default

    Ok, I turned off all power options in the BIOS.
    Indeed it seemed to be the DOM: today the system froze again, and now it won't even boot beyond the "loading" screen.
    So I unplugged the DOM and took an external USB drive and installed the software on that without any issue.

    Unfortunately, after finishing the install and rebooting, it hangs on "loading" again, so I tried booting the old DOM in v6, but to no avail...
    I had the RAID array checked during the night: zero errors.
    But still: no DSS version is loading succesfully now.

    So I took out all hardware and rebooted, et voila: it ran again.
    I plugged all hardware in, changed the slots of the NICs to avoid any IRQ errors and booted succesfully again.
    Apparently moving the NIC was helpful.

    After that, I cleared the TDB Database and reconnected to the DC and got it running.
    However, I faced once that the machine hang during booting (just after "init runlevel 2"), so I'll order a new DOM just to be sure...
    Last edited by Arcesilaus; 10-02-2012 at 10:43 AM. Reason: update

  2. #2
    Join Date
    Oct 2009
    Posts
    53

    Default

    Unfortunately, it is time for an update.

    Having succesfully updated to v7, I am still experiencing system freezes that drive me nuts.

    I looked in the dmesg.2 logfiles from the downloads I took after reboot, and in 2 out of 3 cases, no notification of any error was found.
    The last time, a reference is made to the onboard SATA controller:

    [ 663.417882] ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen
    [ 663.417885] ata1: irq_stat 0x00000040, connection status changed
    [ 663.417887] ata1: SError: { DevExch }
    [ 663.417893] ata1: hard resetting link
    [ 664.165187] ata1: SATA link down (SStatus 0 SControl 300)
    [ 664.185147] ata1: EH complete
    [ 899.498772] ata1: exception Emask 0x10 SAct 0x0 SErr 0x4000000 action 0xe frozen
    [ 899.498776] ata1: irq_stat 0x00000040, connection status changed
    [ 899.498779] ata1: SError: { DevExch }
    [ 899.498787] ata1: hard resetting link

    However, no disk is connected to that port (only a single SSD on ata2), as can also be seen in earlier in the same logfile:

    5.547281] NET: Registered protocol family 5
    [ 5.638323] RPC: Registered rdma transport module.
    [ 5.907834] ata1: SATA link down (SStatus 0 SControl 300)
    [ 6.297160] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
    [ 6.298030] ata2.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded
    [ 6.298033] ata2.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
    [ 6.298035] ata2.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
    [ 6.298566] ata2.00: ATA-9: OCZ-VERTEX4, 1.5, max UDMA/133
    [ 6.298569] ata2.00: 500118192 sectors, multi 16: LBA48 NCQ (depth 31/32), AA
    [ 6.299425] ata2.00: ACPI cmd ef/10:06:00:00:00:00 (SET FEATURES) succeeded
    [ 6.299427] ata2.00: ACPI cmd f5/00:00:00:00:00:00 (SECURITY FREEZE LOCK) filtered out
    [ 6.299429] ata2.00: ACPI cmd b1/c1:00:00:00:00:00 (DEVICE CONFIGURATION OVERLAY) filtered out
    [ 6.299968] ata2.00: configured for UDMA/133

    After experiencing issues with another SSD (previously attached to ata1), I replaced almost all hardware and replaced the SSD with the one that is now attached to ata2.
    The SSD is used as an iSCSI target (file i/o) and hosts a Windows 2012 VM (vSphere) with a small SQL 2012 database.

    Strangely enough, I've ran this exact config for over 2 years without any problem, until 2 months ago.
    Is there anyone that has a clue?

  3. #3
    Join Date
    Oct 2010
    Location
    GA
    Posts
    935

    Default

    Did you do this:
    Try to disable APM and ACPI in CTRL+ALT+T menu(in console mode ).ctrl+alt+t->boot options(9)->Boot parameters(1).

  4. #4
    Join Date
    Oct 2009
    Posts
    53

    Default

    I did, and I also disabled all ACPI functions in the machine's BIOS.
    Furthermore, I've replaced the power supply yesterday and moved the SSD to the Areca Controller, to see if the onboard SATA controller might be the problem.

    By the way -just for the information of others-, after moving the disk from the onboard controller to the Areca controller, another reboot was required before I could reconnect the Volume to the iSCSI target, but it worked surprisingly well.

    Unfortunately, this morning the machine went down again, and not being home, I haven't had a chance to reset the machine and look at the logs.
    Tonight, I'll bring the machine back up, check the ACPI settings, examine the log and post my findings here.

    Are there any logs beside the dmesg.2 and critial_errors.log (so far: this was always empty) files that I should examine?

    P.s. I found two other forum threads in which similar behavior is mentioned: here and here. With reference to the first, recalling when the freezes started, I think it was after upgrading above v6 update 90.
    Unfortunately, in neither thread a final outcome is reported.
    Last edited by Arcesilaus; 10-25-2012 at 03:46 PM. Reason: related threads found

  5. #5
    Join Date
    Aug 2010
    Posts
    404

    Default

    One of the links show a solution by replacing disk(s) with errors, after that the system start to work fine.
    So we recommend you to check your controller health and to check the storage disks, then to run file system repair tool to fix your FS if it contain any issue.
    Please check this KB: http://kb.open-e.com/File-system-repair_138.html

    Also please be sure that your controller Firmware is updated to the latest firmware available at your controller website.

  6. #6
    Join Date
    Oct 2009
    Posts
    53

    Default

    Thanks for the hints: I've checked the controller's and disks' health and no issue was found. All are running the latest firmware builds.
    However, I found that the ACPI features were still turned on in the boot options - I guess I turned them off before upgrading to v7 and forgot to check afterwards.
    I turned it off and rebooted.

    The machine is up and running again and I keep my fingers crossed...
    Thanks for your help and I'll keep this thread up to date so it might be helpful for other users (although maybe just the inexperienced users like me).

  7. #7
    Join Date
    Oct 2010
    Location
    GA
    Posts
    935

    Default

    FYI: driver settings, and boot settings and a few more, are not saved when going from DSS V6-->DSS V7. I thought this was in the release notes someplace. Guess I will have to check on that!

Tags for this Thread

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •