Visit Open-E website
Results 1 to 10 of 22

Thread: Server hangs every few days

Hybrid View

Previous Post Previous Post   Next Post Next Post
  1. #1

    Default

    Hello,

    I have uploaded the logs here: logs. I couldn't find any errors in the logs, maybe you can take a look.

    Thanks

  2. #2

    Default

    Hey,

    open a ticket via http://www.open-e.com/service-and-support/ . Please attach logs downloaded via WebGUI.

    Ja-B

  3. #3
    Join Date
    Oct 2010
    Location
    GA
    Posts
    935

    Default

    Looking at what you have provided, the efollowing needs to be changed:
    you have:
    MaxBurstLength=1048576
    FirstBurstLength=262144

    change to:
    MaxBurstLength=16776192
    FirstBurstLength=65536

    Also you need to make sure to change each target, not just the first one.

    Overall these settings work welll:
    maxRecvDataSegmentLen=262144
    MaxBurstLength=16776192
    Maxxmitdatasegment=262144
    FirstBurstLength=65536
    DataDigest=None
    maxoutstandingr2t=8
    InitialR2T=No
    ImmediateData=Yes
    headerDigest=None
    Wthreads=8

    And I cant see the NIC settings in your upload, but you can also try jubo frames for the NICs: http://kb.open-e.com/Does-Open-E-sup...Frames_28.html

    Make sure initiators have matching settings, as this can cause the machine to seem locked/stalled.

    But complete logs would give a better picture as to wether or not there are other issues.
    I can only see a few files in yur link, not the whole package.

  4. #4

    Default

    Update: i have disabled some CPU power saving features in the bios, and it is running stable now.

  5. #5

    Default

    Unfortunately the box crashed again last night. But now i have some errors in the log:

    Code:
    	2011-04-12 01:59:10	scsi cmnd aborted, scsi_cmnd(0xffff88012c6d4700), cmnd[0x8a,0x 0,0x 0,0x 0,0x 0,0x 1,0xb0,0x9e,0xf8,0x 0,0x... (0/1) 	 	
    	2011-04-12 01:58:40	scsi cmnd aborted, scsi_cmnd(0xffff88012c6d4340), cmnd[0x8a,0x 0,0x 0,0x 0,0x 0,0x 1,0xb0,0x9e,0xf7,0x 0,0x... (1/1) 	 	
    	2011-04-12 01:58:10	scsi cmnd aborted, scsi_cmnd(0xffff88012c6d45c0), cmnd[0x8a,0x 0,0x 0,0x 0,0x 0,0x 1,0xb0,0x9e,0xf6,0x 0,0x... (1/1) 	 	
    	2011-04-12 01:57:40	scsi cmnd aborted, scsi_cmnd(0xffff88012c6d40c0), cmnd[0x8a,0x 0,0x 0,0x 0,0x 0,0x 1,0xb0,0x9e,0xf5,0x 0,0x... (0/1)
    This points to an hardware error. The raid controller has been replaced so it seems the backplane is not working properly.

  6. #6
    Join Date
    Oct 2010
    Location
    GA
    Posts
    935

    Default

    did you ever adjust for the values I mentioned above?

  7. #7
    Join Date
    Aug 2010
    Posts
    404

    Default

    Please try the values that Gr-R mentioned to you, also did you checked for your RAID Controller health ?

  8. #8

    Default

    The Raid controller sometimes times out accessing drives in the enclosure. It's not the same drive everytime so it must be something with the backplane.

    I've changed the ISCSI values mentioned on all initiators and targets this morning. So let's see how long it keeps running this time

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •