устанавливаем новую физическую машину с 10 дисками (кроме диска ОС)
каждый диск на самом деле содержит файловую систему HDFS
после того, как мы установили Linux-машину с комплектом hadoop, мы запустили datanode и заметили множество ошибок из dmesg
так как:
EXT4-fs warning (device sdi): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
EXT4-fs (sdi): warning: mounting fs with errors, running e2fsck is recommended
так как ошибки есть на всем диске (ставим новые 10 дисков)
тогда это не логично, что все диски плохие, что мы подозреваем, это что-то между дисками и HW-машиной
как разъемы, материнская плата и т. д.
я прав со своим выводом?
больше деталей от dmesg
:
[ *CO*] ERST: Error Record Serialization Table (ERST) support is initialized.
[ *CO*] ACPI Error: No handler for Region [SYSI] (ffff883fd24f87e0) [IPMI] (*N47*/evregion-162)
[ *CO*] ACPI Error: Region IPMI (ID=7) has no handler (*N47*/exfldio-305)
[ *CO*] ACPI Error: Method parse/execution failed [\_SB_.PMI0._GHL] (Node ffff883fd24de500), AE_NOT_EXIST (*N47*/psparse-536)
[ *CO*] ACPI Error: Method parse/execution failed [\_SB_.PMI0._PMC] (Node ffff883fd24de460), AE_NOT_EXIST (*N47*/psparse-536)
[ *CO*] EXT4-fs (sdl): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdi): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdi): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs (sdf): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs (sde): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdc): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdc): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs (sdd): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdh): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdh): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdg): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdg): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdj): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdj): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdb): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdb): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sda): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sda): warning: mounting fs with errors, running e2fsck is recommended
[ *CO*] EXT4-fs warning (device sdk): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs (sdk): warning: mounting fs with errors, running e2fsck is recommended
другие сбои из dmesg
[ *CO*] ACPI: \_SB_.SCK1.CP2E: failed to get CPU physical ID.
[ *CO*] ACPI: \_SB_.SCK1.CP2F: failed to get CPU physical ID.
[ *CO*] pci 0000:ff*CO*: BAR 2: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:ff*CO*: BAR 4: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:ff*CO*: BAR 2: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:ff*CO*: BAR 4: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:ff*CO*: BAR 1: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:ff*CO*: BAR 3: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:ff*CO*: BAR 5: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:ff*CO*: BAR 1: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:ff*CO*: BAR 3: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:ff*CO*: BAR 5: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 2: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:7f*CO*: BAR 4: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:7f*CO*: BAR 2: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:7f*CO*: BAR 4: failed to assign [mem size 0x*N53*]
[ *CO*] pci 0000:7f*CO*: BAR 1: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 3: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 5: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 1: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 3: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:7f*CO*: BAR 5: failed to assign [mem size 0x*N54*]
[ *CO*] pci 0000:03*CO*: BAR 6: failed to assign [mem size 0x*N21* pref]
[ *CO*] ioapic: probe of 0000:00*CO* failed with error -22
[ *CO*] ioapic: probe of 0000:80*CO* failed with error -22
[ *CO*] ACPI Error: Method parse/execution failed [\_SB_.PMI0._GHL] (Node ffff883fd24de500), AE_NOT_EXIST (*N47*/psparse-536)
[ *CO*] ACPI Error: Method parse/execution failed [\_SB_.PMI0._PMC] (Node ffff883fd24de460), AE_NOT_EXIST (*N47*/psparse-536)
[ *CO*] mei_me 0000:00*CO*: initialization failed.
[ *CO*] EXT4-fs warning (device sdi): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure
[ *CO*] EXT4-fs warning (device sdc): ext4_clear_journal_err:4697: Filesystem error recorded from previous mount: IO failure