1) Не заряжается BBU. В логах - Controller BBU Thermal Shutdown/Enter Sleep-Mode !
Статью на esupport читал, действия описанные в ней не помогают. Спустя N часов сообщение появляется снова.
2) Стабильной работы удалось добиться только в режиме U160. Контроллер Adaptec AIC7902 (интегрирован в Supermicro AS1020A-8). Кабель - комплектный от Eonstor + пробовал другой. Настройки Adaptec - в соответствии со статьей в esupport. ОС - Gentoo Linux, ядро - 2.6.15. При попытке обращения - в логах со страшной силой начинает появляеться страшный мат и все встает раком. Таких логов у меня уже 3.5Гб.
Код: Выделить всё
(scsi1:A:0:1): Probable outgoing LQ CRC error. Retrying command
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56639
Buffer I/O error on device sdd1, logical block 56576
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56640
Buffer I/O error on device sdd1, logical block 56577
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56641
Buffer I/O error on device sdd1, logical block 56578
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56642
Buffer I/O error on device sdd1, logical block 56579
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56643
Buffer I/O error on device sdd1, logical block 56580
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56644
Buffer I/O error on device sdd1, logical block 56581
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56645
Buffer I/O error on device sdd1, logical block 56582
sd 1:0:0:1: SCSI error: return code = 0x10000
end_request: I/O error, dev sdd, sector 56646
Buffer I/O error on device sdd1, logical block 56583
shutdown[6284]: shutting down for system reboot
init: Switching to runlevel: 6
sd 1:0:0:1: Attempting to queue an ABORT message:CDB: 0x28 0x0 0x0 0x0 0xdc 0xbf 0x0 0x0 0x75 0x0
scsi1: At time of recovery, card was not paused
>>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
scsi1: Dumping Card State at program address 0x12 Mode 0x33
Card was paused
HS_MAILBOX[0x0] INTCTL[0x80]:(SWTMINTMASK) SEQINTSTAT[0x0]
SAVED_MODE[0x11] DFFSTAT[0x33]:(CURRFIFO_NONE|FIFO0FREE|FIFO1FREE)
SCSISIGI[0x0]:(P_DATAOUT) SCSIPHASE[0x0] SCSIBUS[0x0]
LASTPHASE[0x1]:(P_DATAOUT|P_BUSFREE) SCSISEQ0[0x0]
SCSISEQ1[0x12]:(ENAUTOATNP|ENRSELI) SEQCTL0[0x0]
SEQINTCTL[0x0] SEQ_FLAGS[0x0] SEQ_FLAGS2[0x0] SSTAT0[0x0]
SSTAT1[0x8]:(BUSFREE) SSTAT2[0x0] SSTAT3[0x0] PERRDIAG[0xc0]:(HIPERR|HIZERO)
SIMODE1[0xa4]:(ENSCSIPERR|ENSCSIRST|ENSELTIMO)
LQISTAT0[0x0] LQISTAT1[0x0] LQISTAT2[0x0] LQOSTAT0[0x0]
LQOSTAT1[0x0] LQOSTAT2[0xe1]:(LQOSTOP0|LQOPKT)
SCB Count = 32 CMDS_PENDING = 32 LASTSCB 0x4 CURRSCB 0x19 NEXTSCB 0xff40
qinstart = 168 qinfifonext = 168
QINFIFO:
WAITING_TID_QUEUES:
Pending list:
25 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
26 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
4 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
27 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
12 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
18 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
2 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
0 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
15 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
31 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
16 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
19 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
22 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
23 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
5 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
8 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
28 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
24 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
10 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
7 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
6 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
1 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
11 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
20 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
21 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
17 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
14 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
13 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
3 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
9 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
30 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
29 FIFO_USE[0x0] SCB_CONTROL[0x60]:(TAG_ENB|DISCENB) SCB_SCSIID[0xe7]
Total 32
Kernel Free SCB list:
Sequencer Complete DMA-inprog list:
Sequencer Complete list:
Sequencer DMA-Up and Complete list:
scsi1: FIFO0 Free, LONGJMP == 0x823a, SCB 0x18
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x4]:(DIRECTION) DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
scsi1: FIFO1 Free, LONGJMP == 0x8063, SCB 0x3
SEQIMODE[0x3f]:(ENCFG4TCMD|ENCFG4ICMD|ENCFG4TSTAT|ENCFG4ISTAT|ENCFG4DATA|ENSAVEPTRS)
SEQINTSRC[0x0] DFCNTRL[0x0] DFSTATUS[0x89]:(FIFOEMP|HDONE|PRELOAD_AVAIL)
SG_CACHE_SHADOW[0x2]:(LAST_SEG) SG_STATE[0x0] DFFSXFRCTL[0x0]
SOFFCNT[0x0] MDFFSTAT[0x5]:(FIFOFREE|DLZERO) SHADDR = 0x00, SHCNT = 0x0
HADDR = 0x00, HCNT = 0x0 CCSGCTL[0x10]:(SG_CACHE_AVAIL)
LQIN: 0x4 0x0 0x0 0x18 0x0 0x1 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x4 0x0 0x0 0x0 0x0 0x0 0x0
scsi1: LQISTATE = 0x0, LQOSTATE = 0x0, OPTIONMODE = 0x52
scsi1: OS_SPACE_CNT = 0x20 MAXCMDCNT = 0x1
SIMODE0[0xc]:(ENOVERRUN|ENIOERR)
CCSCBCTL[0x4]:(CCSCBDIR)
scsi1: REG0 == 0xd, SINDEX = 0x152, DINDEX = 0x11e
scsi1: SCBPTR == 0x19, SCB_NEXT == 0xff40, SCB_NEXT2 == 0xfff4
CDB 2a 0 0 54 4 10
STACK: 0x0 0x0 0x0 0x0 0x0 0x0 0x0 0x0
<<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
(scsi1:A:14:1): Device is disconnected, re-queuing SCB
Recovery code sleeping
scsi1: ILLEGAL_PHASE 0x80
(scsi1:A:14:1): Abort Message Sent
Recovery code awake
Timer Expired
aic79xx_abort returns 0x2003
sd 1:0:14:1: Attempting to queue an ABORT message:CDB: 0x2a 0x0 0x0 0x34 0xc 0x10 0x0 0x4 0x0 0x0
scsi1: At time of recovery, card was not paused
Код: Выделить всё
Recovery SCB completes
(scsi1:A:14:1): Device is disconnected, re-queuing SCB
Recovery code sleeping
scheduling while atomic: scsi_eh_1/0xfffffffe/862
[<c04f1b81>] schedule+0xa91/0xd50
[<c011f5ad>] release_console_sem+0xbd/0xc0
[<c011f346>] vprintk+0x196/0x2b0
[<c03a5fc1>] ahd_flush_qoutfifo+0x41/0x1100
[<c04f2e15>] __down+0x75/0xe0
[<c011a5a0>] default_wake_function+0x0/0x20
[<c04f108f>] __down_failed+0x7/0xc
[<c03d0b9c>] .text.lock.aic79xx_osm+0x22/0x36
[<c03cec10>] ahd_linux_sem_timeout+0x0/0x50
[<c03cc578>] ahd_linux_abort+0x18/0x40
[<c039b18d>] scsi_eh_abort_cmds+0x3d/0xa0
[<c039bdc2>] scsi_unjam_host+0xb2/0xd0
[<c039bde0>] scsi_error_handler+0x0/0xb0
[<c039be7f>] scsi_error_handler+0x9f/0xb0
[<c0134b3d>] kthread+0xbd/0x100
[<c0134a80>] kthread+0x0/0x100
[<c0101225>] kernel_thread_helper+0x5/0x10
ault_wake_n+0x0/0x20c04f108f>] __down_failc
[<c0] .text.lock.asm+0x22/0x36
[<c03cec10>] ahd_linux_sem_timeout+0x0/0x50
[<c03cc578>] ahd_linux_abort+0x18/0x40
[<c039b18d>] scsi_eh_abort_cmds+0x3d/0xa0
[<c039bdc2>] scsi_unjam_host+0xb2/0xd0
[<c039bde0>] scsi_error_handler+0x0/0xb0
[<c039be7f>] scsi_error_handler+0x9f/0xb0
[<c0134b3d>] kthread+0xbd/0x100
[<c0134a80>] kthread+0x0/0x100
[<c0101225>] kernel_thread_helper+0x5/0x10
scheduling while atomic: scsi_eh_1/0xfffffffd/862
[<c04f1b81>] schedule+0xa91/0xd50
[<c011f5ad>] release_console_sem+0xbd/0xc0
[<c04f2e15>] __down+0x75/0xe0
[<c011a5a0>] default_wake_function+0x0/0x20
[<c04f108f>] __down_failed+0x7/0xc
[<c03d0b9c>] .text.lock.aic79xx_osm+0x22/0x36
[<c03cec10>] ahd_linux_sem_timeout+0x0/0x50
[<c03cc578>] ahd_linux_abort+0x18/0x40
[<c039b18d>] scsi_eh_abort_cmds+0x3d/0xa0
[<c039bdc2>] scsi_unjam_host+0xb2/0xd0
[<c039bde0>] scsi_error_handler+0x0/0xb0
[<c039be7f>] scsi_error_handler+0x9f/0xb0
[<c0134b3d>] kthread+0xbd/0x100
[<c0134a80>] kthread+0x0/0x100
[<c0101225>] kernel_thread_helper+0x5/0x10
Все бы ничего в варианте U160, но видится мне что какая-то проблема все-таки есть и в дальнейшем может дать о себе знать.
3) Использовал ли кто-нибудь устройства размером больше 2TB в указанной связке? Судя по тому что я нашел в google, в драйверах AIC есть ряд давних проблем с этим.
Код: Выделить всё
scsi1 : Adaptec AIC79XX PCI-X SCSI HBA DRIVER, Rev 1.3.11
<Adaptec AIC7902 Ultra320 SCSI adapter>
aic7902: Ultra320 Wide Channel B, SCSI Id=7, PCI-X 101-133Mhz, 512 SCBs
Vendor: IFT Model: A24U-G2421 Rev: 342J
Type: Direct-Access ANSI SCSI revision: 03
target1:0:14: asynchronous.
scsi1:A:14:0: Tagged Queuing enabled. Depth 32
target1:0:14: Beginning Domain Validation
target1:0:14: wide asynchronous.
target1:0:14: FAST-160 WIDE SCSI 320.0 MB/s DT IU QAS PCOMP (6.25 ns, offset 127)
target1:0:14: Ending Domain Validation
Vendor: IFT Model: A24U-G2421 Rev: 342J
Type: Direct-Access ANSI SCSI revision: 05
scsi1:A:14:1: Tagged Queuing enabled. Depth 32
....... skip .........
sd 1:0:14:0: Attached scsi disk sdc
sdd : very big device. try to use READ CAPACITY(16).
sdd : READ CAPACITY(16) failed.
sdd : status=0, message=00, host=5, driver=00
sdd : use 0xffffffff as device size
SCSI device sdd: 4294967296 512-byte hdwr sectors (2199023 MB)
SCSI device sdd: drive cache: write back
sdd : very big device. try to use READ CAPACITY(16).
sdd : READ CAPACITY(16) failed.
sdd : status=0, message=00, host=5, driver=00
sdd : use 0xffffffff as device size
SCSI device sdd: 4294967296 512-byte hdwr sectors (2199023 MB)
SCSI device sdd: drive cache: write back
sdd: unknown partition table
5) Ну и еще, для полноты картины :) , синий индикатор в Slot23 не горит ни при каких обстоятельствах.
Вобщем, кто виноват и что делать?