From mboxrd@z Thu Jan  1 00:00:00 1970
From: IT Superhack <itsuperhack@web.de>
To: development@lists.ipfire.org
Subject:
 Re: Lots of kernel errors on /dev/sda right after upgrading to Core104
Date: Fri, 23 Sep 2016 19:31:00 +0000
Message-ID: <4e495af9-c67f-430c-a903-15f2f5a99dbd@web.de>
In-Reply-To: <fa3b2733-54bc-a4f5-1138-1793dfeb4836@ipfire.org>
MIME-Version: 1.0
Content-Type: multipart/mixed; boundary="===============3964013312388717878=="
List-Id: <development.lists.ipfire.org>

--===============3964013312388717878==
Content-Type: text/plain; charset="utf-8"
Content-Transfer-Encoding: quoted-printable

Hello Matthias,

there are some complaints about kernel errors in this forum branch:
http://forum.ipfire.org/viewtopic.php?f=3D17&t=3D17195&p=3D100655

Some of them look quite similar to those you encountered.

Personally, I cannot confirm this on my IPFire machine (ARM board
with 32GB SD card), but maybe the error only occurs if a "real" HDD
is in use.

Best regards,
Timmothy Wilson

Matthias Fischer:
> Hi list,
>=20
> I'm experiencing an interesting problem here:
>=20
> *Exactly* after rebooting after upgrading to Core 104, which worked
> without any problems, I find that I got a lot of kernel errors:
>=20
> ...
> 21:50:22	kernel: 	REJECT_FORWARD IN=3Dgreen0 OUT=3Dred0 MAC=3D00:02:b3:1a:b=
0:96:00:07:e9:40:b8:29:08:00 SRC=3D192.168.100.1 DST=3D192.168.100.254 LEN=3D=
52 TOS=3D0x00 PREC=3D0x00 TTL=3D127 ID=3D17075 DF PROTO=3DTCP SPT=3D50633 DPT=
=3D444 WINDOW=3D8192 RES=3D0x00 SYN URGP=3D0
> 21:50:32	kernel: 	Kernel logging (proc) stopped.
> 21:50:32	kernel: 	Kernel log daemon terminating.
> 21:52:40	kernel: 	IPv6: ADDRCONF(NETDEV_UP): green0: link is not ready
> 21:52:40	kernel: 	e100 0000:00:03.0 green0: NIC Link is Up 100 Mbps Full Du=
plex
> 21:52:41	kernel: 	IPv6: ADDRCONF(NETDEV_CHANGE): green0: link becomes ready
> 21:52:41	kernel: 	IPv6: ADDRCONF(NETDEV_UP): red0: link is not ready
> 21:52:41	kernel: 	e100 0000:00:06.0 red0: NIC Link is Up 100 Mbps Full Dupl=
ex
> 21:52:41	kernel: 	IPv6: ADDRCONF(NETDEV_CHANGE): red0: link becomes ready
> 21:54:01	kernel: 	perf samples too long (2501 > 2500), lowering kernel.perf=
_event_max_sample_rate to 50100
> 21:55:01	kernel: 	sd 1:0:0:0: [sda] Unhandled error code
> 21:55:01	kernel: 	sd 1:0:0:0: [sda]
> 21:55:01	kernel: 	Result: hostbyte=3DDID_ABORT driverbyte=3DDRIVER_OK
> 21:55:01	kernel: 	sd 1:0:0:0: [sda] CDB:
> 21:55:01	kernel: 	ATA command pass through(16): 85 06 20 00 00 00 00 00 00 =
00 00 00 00 40 e5 00
> 21:55:01	kernel: 	sd 1:0:0:0: [sda] Unhandled error code
> 21:55:01	kernel: 	sd 1:0:0:0: [sda]
> 21:55:01	kernel: 	Result: hostbyte=3DDID_ABORT driverbyte=3DDRIVER_OK
> 21:55:01	kernel: 	sd 1:0:0:0: [sda] CDB:
> 21:55:01	kernel: 	ATA command pass through(16): 85 06 20 00 00 00 00 00 00 =
00 00 00 00 40 98 00
> ...
>=20
> Profile:
> http://fireinfo.ipfire.org/profile/63d7b5d45f8a7816ca68810ed0061d7ff95a9958
>=20
> Nothing was changed, I tested the drive including cables =3D> no problems:
>=20
> ***SNIP***
> root(a)ipfire: / # smartctl -a /dev/sda
> smartctl 6.5 2016-05-07 r4318 [i686-linux-3.14.79-ipfire-pae] (IPFire 2.19)
> Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org
>=20
> =3D=3D=3D START OF INFORMATION SECTION =3D=3D=3D
> Vendor:               IBM
> Product:              DDYS-T36950M
> Revision:             S96H
> User Capacity:        36,703,918,080 bytes [36.7 GB]
> Logical block size:   512 bytes
> Rotation Rate:        10000 rpm
> Logical Unit id:      0x5005076707c1df6e
> Serial number:        4FY3P734
> Device type:          disk
> Transport protocol:   Fibre channel (FCP-2)
> Local Time is:        Fri Sep 23 18:46:45 2016 CEST
> SMART support is:     Available - device has SMART capability.
> SMART support is:     Enabled
> Temperature Warning:  Enabled
>=20
> =3D=3D=3D START OF READ SMART DATA SECTION =3D=3D=3D
> SMART Health Status: OK
>=20
> Current Drive Temperature:     42 C
> Drive Trip Temperature:        85 C
>=20
> Manufactured in week 46 of year 2001
> Specified cycle count over device lifetime:  10000
> Accumulated start-stop cycles:  139
> Elements in grown defect list: 2
>=20
> Error counter log:
>            Errors Corrected by           Total   Correction     Gigabytes  =
  Total
>                ECC          rereads/    errors   algorithm      processed  =
  uncorrected
>            fast | delayed   rewrites  corrected  invocations   [10^9 bytes]=
  errors
> read:          0        0         0         3          3       2490.529    =
       0
> write:         0        2         0        16         16       1534.193    =
       0
> verify:        0        0         0         0          0         73.408    =
       0
>=20
> Non-medium error count:        0
> ***SNAP***
>=20
> What puzzles me is the fact that these errors first *occur right after rebo=
oting with Core104*,
> but *never* before, all hardware is running and seems to be ok. No Panic, n=
o crash, nothing,
> just these kernel errors since upgrading.
>=20
> Does anyone have an idea?
>=20
> Best,
> Matthias
>=20



--===============3964013312388717878==
Content-Type: application/pgp-signature
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename="signature.asc"
MIME-Version: 1.0

LS0tLS1CRUdJTiBQR1AgU0lHTkFUVVJFLS0tLS0KCmlRRWNCQUVCQ2dBR0JRSlg1WU1lQUFvSkVP
eUxhMUM1RWF6ckJZRUlBSTUvVUNpVjJCSDlDb09ydmg1cHFrSFQKNkRFcC9kbzVqc1FjblVJRFhT
K2o1RWJuQ0Vtbm1vMEpnRTVlYldIM1dqeHRPNTJmVUlYVFB2YjBXK2d6QkkzWQowMllNNW5NSlVv
czZZdk45OXFIeU9tWnFVTUhQb3BuK1dNaHZtVUJ6NU5qSG9laUJBTlRoL0oxNTVoT3FTckhJCm1a
WUtkMFJ0L2J2a0kyY21jbUlGN3ovMDFoRzdHN3FoQUhTZjVyMFVSSVRpcThjSU9PNTQxNm5UL0hF
ekU5MFoKb1hoeTAwOVVDMGJ3NFR0N1k0TWUrUWdNNXdoLzBDMFdzMnBXcFdSaHozYTRSa25QRUJs
dFdKOXUrMUcwZUxyWQpBWmNKWUNkQks5UGY4SWE2dG0zMHlMTysxNm8rTFJlVmQrNFpmZ1ZrNEtY
Rm8xU29jQ0pBRVMrRFIyQWpUeHc9Cj13dFpyCi0tLS0tRU5EIFBHUCBTSUdOQVRVUkUtLS0tLQo=

--===============3964013312388717878==--