Aller au contenu

DS918+ "Zombie"


Sethy

Messages recommandés

Bonjour à tous,

Je fais face depuis 6 mois environ a des soucis sur le DS918+ (DSM 6.2.4-25556 Update 5, tout comme les deux autres NAS). Je n'ai rien modifié de fondamental sur la machine depuis une bonne année. 

Après un redémarrage, la machine fonctionne parfaitement bien pendant quelques jours ou quelques semaines. Par la suite et sans aucune explication, il n'est plus possible de se connecter à la machine : ni via l'interface web, ni via le terminal et pas plus via l'outil de configuration Synology qui ne trouve tout simplement plus la machine.

Par contre, la machine répond au ping et Cloud Station Drive est toujours actif puisque les données sont répliquées sur plusieurs PCs. Mais les réplications depuis les autres Synology, eux ne fonctionne pas.

Je n'ai d'autres choix que de faire un hard reboot à chaque fois et j'ai l'impression que la durée entre deux pannes à tendances à diminuer.

Sur le site de Synology, un test de mémoire est suggéré, mais aucune erreur n'a été détectée.

Quelqu'un a une idée de la cause du problème ?

 

D'avance merci,

Sethy

Lien vers le commentaire
Partager sur d’autres sites

Niveau disque non, rien à signaler. J'ai viré le cache SSD il y a 4 ans.

Pour ce qui est du time-out, peux-tu être plus précis ? Je lance la connexion "web" et effectivement, ça patine sans fin, jusqu'à avoir un message du navigateur.

Dans les logs, je ne vois absolument rien de spécial.

 

Lien vers le commentaire
Partager sur d’autres sites

  • 2 semaines après...

Bonjour, le ventilateur tourne bien ? (surchauffe possible)
En dehors des valeurs smart à vérifier, vous pouvez lancer les tests smart longs et aussi essayer de changer la pile. (CR2032)
https://fr.ifixit.com/Tutoriel/Synology+DS218++-+Démontage+complet/112475

Modifié par jacaj
Lien vers le commentaire
Partager sur d’autres sites

Le 19/04/2022 à 12:37, Einsteinium a dit :

Affiche les valeurs smart de tes disques durs, récupère via ssh, cela évitera l’interface web.

J'ai un time-out sur la connexion ssh, mais dans la mesure où il ne répond pas au ping, ça me semble logique.

Après hard reboot, voici les tests SMART des 4 disques (sudo smartctl -d ata /dev/sda -a) :

smartctl 6.5 (build date Mar  2 2021) [x86_64-linux-4.4.59+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68N32N0
Serial Number:    WD-WCC7K1US813T
LU WWN Device Id: 5 0014ee 265091edc
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Apr 20 19:46:33 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (32640) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 347) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x303d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME                                                   FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate                                              0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time                                                     0x0027   164   163   021    Pre-fail  Always       -       6758
  4 Start_Stop_Count                                                 0x0032   100   100   000    Old_age   Always       -       27
  5 Reallocated_Sector_Ct                                            0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate                                                  0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours                                                   0x0032   064   064   000    Old_age   Always       -       26361
 10 Spin_Retry_Count                                                 0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count                                          0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count                                                0x0032   100   100   000    Old_age   Always       -       27
192 Power-Off_Retract_Count                                          0x0032   200   200   000    Old_age   Always       -       6
193 Load_Cycle_Count                                                 0x0032   200   200   000    Old_age   Always       -       30
194 Temperature_Celsius                                              0x0022   119   111   000    Old_age   Always       -       31
196 Reallocated_Event_Count                                          0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector                                           0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable                                            0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count                                             0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate                                            0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     26360         -
# 2  Short offline       Completed without error       00%     23932         -
# 3  Short offline       Completed without error       00%     23213         -
# 4  Short offline       Completed without error       00%     22470         -
# 5  Short offline       Completed without error       00%     21726         -
# 6  Short offline       Completed without error       00%     21007         -
# 7  Short offline       Completed without error       00%     20264         -
# 8  Short offline       Completed without error       00%     19545         -
# 9  Short offline       Completed without error       00%     18802         -
#10  Short offline       Completed without error       00%     18131         -
#11  Short offline       Completed without error       00%     17388         -
#12  Short offline       Completed without error       00%     16645         -
#13  Short offline       Completed without error       00%     15925         -
#14  Short offline       Completed without error       00%     15181         -
#15  Short offline       Completed without error       00%     14465         -
#16  Short offline       Completed without error       00%     13722         -
#17  Short offline       Completed without error       00%     12979         -
#18  Short offline       Completed without error       00%     12260         -
#19  Short offline       Completed without error       00%     11516         -
#20  Short offline       Completed without error       00%     10797         -
#21  Short offline       Completed without error       00%     10055         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
smartctl 6.5 (build date Mar  2 2021) [x86_64-linux-4.4.59+] (local build)
Copyright (C) 2002-16, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    WD-WCC4N1LEPT9D
LU WWN Device Id: 5 0014ee 2b9581f7a
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Apr 20 19:49:41 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (41460) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 416) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME                                                   FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate                                              0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time                                                     0x0027   184   183   021    Pre-fail  Always       -       5800
  4 Start_Stop_Count                                                 0x0032   100   100   000    Old_age   Always       -       44
  5 Reallocated_Sector_Ct                                            0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate                                                  0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours                                                   0x0032   050   050   000    Old_age   Always       -       36515
 10 Spin_Retry_Count                                                 0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count                                          0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count                                                0x0032   100   100   000    Old_age   Always       -       44
192 Power-Off_Retract_Count                                          0x0032   200   200   000    Old_age   Always       -       7
193 Load_Cycle_Count                                                 0x0032   200   200   000    Old_age   Always       -       416
194 Temperature_Celsius                                              0x0022   119   110   000    Old_age   Always       -       31
196 Reallocated_Event_Count                                          0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector                                           0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable                                            0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count                                             0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate                                            0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36514         -
# 2  Short offline       Completed without error       00%     34086         -
# 3  Short offline       Completed without error       00%     33367         -
# 4  Short offline       Completed without error       00%     32624         -
# 5  Short offline       Completed without error       00%     31880         -
# 6  Short offline       Completed without error       00%     31161         -
# 7  Short offline       Completed without error       00%     30418         -
# 8  Short offline       Completed without error       00%     29699         -
# 9  Short offline       Completed without error       00%     28956         -
#10  Short offline       Completed without error       00%     28285         -
#11  Short offline       Completed without error       00%     27542         -
#12  Short offline       Completed without error       00%     26798         -
#13  Short offline       Completed without error       00%     26079         -
#14  Short offline       Completed without error       00%     25335         -
#15  Short offline       Completed without error       00%     24619         -
#16  Short offline       Completed without error       00%     23876         -
#17  Short offline       Completed without error       00%     23133         -
#18  Short offline       Completed without error       00%     22413         -
#19  Short offline       Completed without error       00%     21670         -
#20  Short offline       Completed without error       00%     20951         -
#21  Short offline       Completed without error       00%     20209         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68N32N0
Serial Number:    WD-WCC7K5XZJ0UX
LU WWN Device Id: 5 0014ee 2650926e9
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Form Factor:      3.5 inches
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Apr 20 19:50:16 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (33120) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 352) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x303d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME                                                   FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate                                              0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time                                                     0x0027   167   167   021    Pre-fail  Always       -       6641
  4 Start_Stop_Count                                                 0x0032   100   100   000    Old_age   Always       -       20
  5 Reallocated_Sector_Ct                                            0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate                                                  0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours                                                   0x0032   070   070   000    Old_age   Always       -       22024
 10 Spin_Retry_Count                                                 0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count                                          0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count                                                0x0032   100   100   000    Old_age   Always       -       20
192 Power-Off_Retract_Count                                          0x0032   200   200   000    Old_age   Always       -       7
193 Load_Cycle_Count                                                 0x0032   200   200   000    Old_age   Always       -       21
194 Temperature_Celsius                                              0x0022   118   111   000    Old_age   Always       -       32
196 Reallocated_Event_Count                                          0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector                                           0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable                                            0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count                                             0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate                                            0x0008   100   253   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     22024         -
# 2  Short offline       Completed without error       00%     19596         -
# 3  Short offline       Completed without error       00%     18877         -
# 4  Short offline       Completed without error       00%     18133         -
# 5  Short offline       Completed without error       00%     17390         -
# 6  Short offline       Completed without error       00%     16671         -
# 7  Short offline       Completed without error       00%     15928         -
# 8  Short offline       Completed without error       00%     15209         -
# 9  Short offline       Completed without error       00%     14466         -
#10  Short offline       Completed without error       00%     13795         -
#11  Short offline       Completed without error       00%     13052         -
#12  Short offline       Completed without error       00%     12309         -
#13  Short offline       Completed without error       00%     11589         -
#14  Short offline       Completed without error       00%     10845         -
#15  Short offline       Completed without error       00%     10129         -
#16  Short offline       Completed without error       00%      9386         -
#17  Short offline       Completed without error       00%      8643         -
#18  Short offline       Completed without error       00%      7924         -
#19  Short offline       Completed without error       00%      7180         -
#20  Short offline       Completed without error       00%      6461         -
#21  Short offline       Completed without error       00%      5719         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.
=== START OF INFORMATION SECTION ===
Model Family:     Western Digital Red
Device Model:     WDC WD30EFRX-68EUZN0
Serial Number:    WD-WCC4N3ZX942H
LU WWN Device Id: 5 0014ee 2b957e4bf
Firmware Version: 82.00A82
User Capacity:    3,000,592,982,016 bytes [3.00 TB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    5400 rpm
Device is:        In smartctl database [for details use: -P show]
ATA Version is:   ACS-2 (minor revision not indicated)
SATA Version is:  SATA 3.0, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Wed Apr 20 19:51:53 2022 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

=== START OF READ SMART DATA SECTION ===
SMART overall-health self-assessment test result: PASSED

General SMART Values:
Offline data collection status:  (0x00) Offline data collection activity
                                        was never started.
                                        Auto Offline Data Collection: Disabled.
Self-test execution status:      (   0) The previous self-test routine completed
                                        without error or no self-test has ever
                                        been run.
Total time to complete Offline
data collection:                (38160) seconds.
Offline data collection
capabilities:                    (0x7b) SMART execute Offline immediate.
                                        Auto Offline data collection on/off support.
                                        Suspend Offline collection upon new
                                        command.
                                        Offline surface scan supported.
                                        Self-test supported.
                                        Conveyance Self-test supported.
                                        Selective Self-test supported.
SMART capabilities:            (0x0003) Saves SMART data before entering
                                        power-saving mode.
                                        Supports SMART auto save timer.
Error logging capability:        (0x01) Error logging supported.
                                        General Purpose Logging supported.
Short self-test routine
recommended polling time:        (   2) minutes.
Extended self-test routine
recommended polling time:        ( 383) minutes.
Conveyance self-test routine
recommended polling time:        (   5) minutes.
SCT capabilities:              (0x703d) SCT Status supported.
                                        SCT Error Recovery Control supported.
                                        SCT Feature Control supported.
                                        SCT Data Table supported.

SMART Attributes Data Structure revision number: 16
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME                                                   FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate                                              0x002f   200   200   051    Pre-fail  Always       -       0
  3 Spin_Up_Time                                                     0x0027   181   180   021    Pre-fail  Always       -       5925
  4 Start_Stop_Count                                                 0x0032   100   100   000    Old_age   Always       -       44
  5 Reallocated_Sector_Ct                                            0x0033   200   200   140    Pre-fail  Always       -       0
  7 Seek_Error_Rate                                                  0x002e   200   200   000    Old_age   Always       -       0
  9 Power_On_Hours                                                   0x0032   050   050   000    Old_age   Always       -       36515
 10 Spin_Retry_Count                                                 0x0032   100   253   000    Old_age   Always       -       0
 11 Calibration_Retry_Count                                          0x0032   100   253   000    Old_age   Always       -       0
 12 Power_Cycle_Count                                                0x0032   100   100   000    Old_age   Always       -       44
192 Power-Off_Retract_Count                                          0x0032   200   200   000    Old_age   Always       -       7
193 Load_Cycle_Count                                                 0x0032   200   200   000    Old_age   Always       -       410
194 Temperature_Celsius                                              0x0022   119   111   000    Old_age   Always       -       31
196 Reallocated_Event_Count                                          0x0032   200   200   000    Old_age   Always       -       0
197 Current_Pending_Sector                                           0x0032   200   200   000    Old_age   Always       -       0
198 Offline_Uncorrectable                                            0x0030   100   253   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count                                             0x0032   200   200   000    Old_age   Always       -       0
200 Multi_Zone_Error_Rate                                            0x0008   200   200   000    Old_age   Offline      -       0

SMART Error Log Version: 1
No Errors Logged

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Short offline       Completed without error       00%     36515         -
# 2  Short offline       Completed without error       00%     34086         -
# 3  Short offline       Completed without error       00%     33367         -
# 4  Short offline       Completed without error       00%     32624         -
# 5  Short offline       Completed without error       00%     31881         -
# 6  Short offline       Completed without error       00%     31161         -
# 7  Short offline       Completed without error       00%     30418         -
# 8  Short offline       Completed without error       00%     29699         -
# 9  Short offline       Completed without error       00%     28957         -
#10  Short offline       Completed without error       00%     28285         -
#11  Short offline       Completed without error       00%     27542         -
#12  Short offline       Completed without error       00%     26799         -
#13  Short offline       Completed without error       00%     26079         -
#14  Short offline       Completed without error       00%     25335         -
#15  Short offline       Completed without error       00%     24619         -
#16  Short offline       Completed without error       00%     23876         -
#17  Short offline       Completed without error       00%     23133         -
#18  Short offline       Completed without error       00%     22414         -
#19  Short offline       Completed without error       00%     21670         -
#20  Short offline       Completed without error       00%     20951         -
#21  Short offline       Completed without error       00%     20209         -

SMART Selective self-test log data structure revision number 1
 SPAN  MIN_LBA  MAX_LBA  CURRENT_TEST_STATUS
    1        0        0  Not_testing
    2        0        0  Not_testing
    3        0        0  Not_testing
    4        0        0  Not_testing
    5        0        0  Not_testing
Selective self-test flags (0x0):
  After scanning selected spans, do NOT read-scan remainder of disk.
If Selective self-test is pending on power-up, resume after 0 minute delay.

(et je ne l'ai pas encore écrit, mais merci de me filer un coup de main).

Modifié par Sethy
Lien vers le commentaire
Partager sur d’autres sites

Les valeurs sont propres, tu as déjà testé la mémoire, pas de cache... je dirais donc que si tu as de la chance, cela est logiciel... tu as des paquets tiers qui tournent ?

Active l'historique d'utilisation dans le moniteur de ressource si c'est pas déjà fait, voir les courbes cpu/ram qui précède ce problème, tu as peu être une merde qui sature le serveur (minage ou autre).

Après si tu ne trouves pas l'origine, il faudra envisagé la restauration du système.

Lien vers le commentaire
Partager sur d’autres sites

@jacaj : j'ai fait le test SMART étendu et les 4 disques sont Healthy.

Pour les ressources, tous les graphiques se ressemblent (constants, jusqu'à ce que la machine s'éteigne).

image.png.f98f5b2c52e9d54dcf29c87bc80de350.png

L'interruption du 30 mars ou du 2 avril correspond au test de la RAM. Comme on peut le voir, le 10 avril, la consommation mémoire chute "comme si" la machine était éteinte.

Sauf que :
1/ les derniers évents datent du 8 avril à 1h du mat'
2/ Cloud Station Drive fonctionnait encore le 17 avril !

Pour info, le volume 4 est plein à 94% (mais il reste 400 GB) :

image.png.52e54684dcdee08e0e7921a516b854cd.png

Voici le log :

image.png.4921e4ceeeb114a7654825c8f1a348c3.png

Lien vers le commentaire
Partager sur d’autres sites

Oups j'avais pas vue ta réponse.

Alors le graphique vraiment bizarre si ton nas était bien en fonction... et concernant la mémoire Echanger cela donne quoi sur la même échelle pour voir ? A mon avis on verra la courbe inverse ou le swap compense, cela semble bien être un problème de mémoire... ouvre un ticket au support (fin de ma signature).

Par contre un volume qui tombe en dessous de 3%, c'est un système qui subira des ralentissements pour information.

Lien vers le commentaire
Partager sur d’autres sites

Je suis le seul utilisateur du NAS et de plus, c'est le NAS Backup. Donc en principe, il "accepte" des snapshots (donc uniquement les blocs qui ont changé sur le master), les "full-backups", etc. Le seul package qui n'est pas encore migré de ce NAS vers le master, c'est Cloud Station.

Je ne suis donc pas étonné par l'usage mémoire. Je précise que je suis monté à 12 GB sur ce NAS.

Ici le swap

image.thumb.png.fd93ed300a045e723786e9bf2b27535a.png

La config :

image.png.746e70431f10eb99e8d0044f2fd7a654.png

 

Modifié par Sethy
Lien vers le commentaire
Partager sur d’autres sites

Swap normal...

Le 21/04/2022 à 11:04, Sethy a dit :

Comme on peut le voir, le 10 avril, la consommation mémoire chute "comme si" la machine était éteinte.

Donc pas un problème matériel, mais logiciel :

- Soit tu contact le support et patiente

- Soit tu désactives tous les paquets et laisse tourné le nas pour voir si cela vient de l'un deux ou de dsm

Lien vers le commentaire
Partager sur d’autres sites

Rejoindre la conversation

Vous pouvez publier maintenant et vous inscrire plus tard. Si vous avez un compte, connectez-vous maintenant pour publier avec votre compte.

Invité
Répondre à ce sujet…

×   Collé en tant que texte enrichi.   Coller en tant que texte brut à la place

  Seulement 75 émoticônes maximum sont autorisées.

×   Votre lien a été automatiquement intégré.   Afficher plutôt comme un lien

×   Votre contenu précédent a été rétabli.   Vider l’éditeur

×   Vous ne pouvez pas directement coller des images. Envoyez-les depuis votre ordinateur ou insérez-les depuis une URL.

×
×
  • Créer...

Information importante

Nous avons placé des cookies sur votre appareil pour aider à améliorer ce site. Vous pouvez choisir d’ajuster vos paramètres de cookie, sinon nous supposerons que vous êtes d’accord pour continuer.