Western Digital Device Analytics Throwing Errors on Synology - Who is at fault?

video:

So I have an update video coming out where I have gone into the DSM source code and figured out what happened with WDDA.

I found this file which I believe to be the document used to communicate WDDA requirements from WD to Synology.

Path: /usr/syno/etc.defaults/disk-wdda/resources/WDDALocalizationResources.en

警报 ResourceID       TranlatedLanguage
DSL.NAME.0xA001 = Lifetime Power On Reset Alert
DSL.NAME.0xA002 = Power On Hours Alert
DSL.NAME.0xA003 = Head Load Lifetime Count Alert
DSL.NAME.0xA004 = Current Temperature Alert
DSL.NAME.0xA005 = Total Lifetime Workload Alert
DSL.NAME.0xA006 = Total Workload Rate Alert
DSL.NAME.0xA007 = Power On Reset Rate Alert
DSL.NAME.0xA008 = Head Load Rate Alert
DSL.NAME.0xA009 = Soft Reset Alert
DSL.NAME.0xA00A = Hard Reset Alert
DSL.NAME.0xA00B = Mechanical Failure Alert
DSL.NAME.0xA00C = Interface CRC Alert
DSL.NAME.0xA00D = Uncorrectable Errors Alert
DSL.NAME.0xA00E = Read Recovery Alert
DSL.NAME.0xA00F = Reallocated Sector Count Alert
DSL.NAME.0xA010 = Asynchronous Signal Recovery Alert
DSL.NAME.0xA011 = Low Temperature Alert
DSL.NAME.0xA012 = High Temperature Alert
DSL.DESCRIPTION.0xA001 = Powering on and off a storage device can cause wear to a storage device over time. This alert will raise an advisory when a large quantity of power on events have occurred throughout the entire life of the storage device.
DSL.DESCRIPTION.0xA002 = This alert will raise an advisory when the storage device has seen a large quantity of power on hours throughout the entire life of the storage device.
DSL.DESCRIPTION.0xA003 = A large amount of head load events can cause wear to a storage device over time. A head load event can occur when the storage device is powered on and off and when the storage device enters various power management states. This alert will raise an advisory when the storage device has seen a large quantity of head load events throughout the entire life of the storage device.
DSL.DESCRIPTION.0xA004 = Using a storage device outside its recommended operating temperature range can lead to various undesirable failures. This alert will raise an advisory when the storage device is approaching or outside the recommended operating temperature.
DSL.DESCRIPTION.0xA005 = Reading and writing to a storage device can cause wear to a storage device over time. This alert will raise an advisory when the storage device has seen a large quantity of read and writes throughout the entire life of the storage device.
DSL.DESCRIPTION.0xA006 = Reading and writing to a storage device can cause wear to a storage device over time. This alert will raise an advisory if the current read and write rate is excessively high and could potentially lead to early storage failure.
DSL.DESCRIPTION.0xA007 = Powering on and off a storage device can cause wear to a storage device over time. This alert will raise an advisory if the current power on reset rate is excessively high and could potentially lead to early storage failure.
DSL.DESCRIPTION.0xA008 = A large amount of head load events can cause wear to a storage device over time. This alert will raise an advisory if the current head load rate is excessively high and could potentially lead to early storage failure.
DSL.DESCRIPTION.0xA009 = A soft reset event occurs when one or more commands sent to the storage device has not completed and is interrupted by the host system.  This alert will raise an advisory when an excessive quantity of soft reset events have occurred within a short period of time which may indicate a potential storage device and/or system issue.
DSL.DESCRIPTION.0xA00A = A hard reset event can signal an issue between the storage device and host system.  This alert will raise an advisory when an excessive quantity of hard reset events have occurred within a short period of time which may indicate a potential storage device and/or system issue.
DSL.DESCRIPTION.0xA00B = A mechanical start failure is a failure that prevents the device from achieving a normal operating condition. This alert will raise an advisory when an excessive quantity of mechanical failures have occurred within a short period of time.
DSL.DESCRIPTION.0xA00C = An interface CRC event is an error event that can occur during data transmission between the storage device and host system.  This alert will raise an advisory when an excessive quantity of interface CRC events have occurred within a short period of time which may indicate a potential storage device and/or system issue.
DSL.DESCRIPTION.0xA00D = An uncorrectable error event occurs when the host system requests data that the storage device is unable to fully read. This alert will raise an advisory when an excessive quantity of uncorrectable error events have occurred within a short period of time which may indicate a potential storage device and/or environmental issue.
DSL.DESCRIPTION.0xA00E = An read recovery event occurs when the host system requests data and the storage device must go through multiple read recovery attempts to fully read the requested data. This alert will raise an advisory when an excessive quantity of read recovery events have occurred within a short period of time which may indicate a potential storage device and/or environmental system issue.
DSL.DESCRIPTION.0xA00F = A reallocated sector event occurs when the storage device decides a location on the media should no longer be used to store data and moves the data at that location to another location on the media. This alert will raise an advisory when an excessive quantity of reallocated sector events have occurred within a short period of time which may indicate a potential storage device and/or environmental system issue.
DSL.DESCRIPTION.0xA010 = An aysnchronous signal recovery event can indicate a possible signal connectivity issue between the storage device and host system.  This alert will raise an advisory when an excessive quantity of interface aysnchronous signal recovery events have occurred within a short period of time.
DSL.DESCRIPTION.0xA011 = Using a storage device outside its recommended operating temperature range can lead to various undesirable failures. This alert will raise an advisory if the storage device was recently used near or below the recommended operating temperature.
DSL.DESCRIPTION.0xA012 = Using a storage device outside its recommended operating temperature range can lead to various undesirable failures. This alert will raise an advisory if the storage device was recently used near or above the recommended operating temperature.
DSL.UNIT.0xA001 = Power On Resets
DSL.UNIT.0xA002 = Hours
DSL.UNIT.0xA003 = Head Loads
DSL.UNIT.0xA004 = Degrees (Celsius)
DSL.UNIT.0xA005 = Sectors
DSL.UNIT.0xA006 = TB/Year
DSL.UNIT.0xA007 = Power On Resets/Hour
DSL.UNIT.0xA008 = Head Loads/Hour
DSL.UNIT.0xA009 = Soft Resets
DSL.UNIT.0xA00A = Hard Resets
DSL.UNIT.0xA00B = Mechanical Failures
DSL.UNIT.0xA00C = Interface CRC Events
DSL.UNIT.0xA00D = Uncorrectable Error Events
DSL.UNIT.0xA00E = Read Recovery Events
DSL.UNIT.0xA00F = Reallocated Sector Events
DSL.UNIT.0xA010 = ASR Events
DSL.UNIT.0xA011 = Degrees (Celsius)
DSL.UNIT.0xA012 = Degrees (Celsius)
ACTION.DESCRIPTION.SMARTSelfTest = Run Extended S.M.A.R.T. Self-Test
ACTION.DESCRIPTION.CheckDeviceConnection = Check HDD Connection
ACTION.DESCRIPTION.LargePowerCycles = HDD has accumulated a large number of power cycles and user might consider replacing HDD soon
ACTION.DESCRIPTION.LargePowerOnHours = HDD has accumulated a large number of power on hours and user might consider replacing HDD soon
ACTION.DESCRIPTION.ReducePowerCycles = Reduce frequency of power cycles to extend the life of HDD
ACTION.DESCRIPTION.LargeHeadLoads = HDD has accumulated a large number of lifetime head loads and user might consider replacing HDD soon
ACTION.DESCRIPTION.SpinDownSoftware = Check and adjust software settings to reduce frequency of HDD spin-down to extend the life of HDD
ACTION.DESCRIPTION.PowerNASOnOff = Power System Off & On
ACTION.DESCRIPTION.Environmental = Check for environmental conditions that could affect the HDD (vibration, shock)
ACTION.DESCRIPTION.Temperature = Check for room temperature and chassis\fan issues that may be affecting temperature
ACTION.DESCRIPTION.StopTemperature = Stop using HDD until temperature has changed to recommended operating range
ACTION.DESCRIPTION.LargeDataAccess = HDD has accumulated a large number of data access and user might consider replacing HDD soon
ACTION.DESCRIPTION.Utilization = Reduce HDD data access rate to extend the life of the HDD

The reason that I believe this is actually the document from western digital is I found the same document on a Japanese NAS manufacture here:

From this we can see that it looks like Synology implemented WDDA correctly.

For those of you who are wondering here is the list of warnings Synology displays based on what WDDA errors come out:

"disk_warning_wdda_disable": "The WDDA warning status for this drive is currently disabled.",
"disk_warning_wdda_suppress": "The WDDA warning status for this drive is currently suppressed.",
"disk_wdda_async": "An excessive quantity of interface asynchronous signal recovery events in detected within a short period of time. This may indicate a possible signal connectivity issue between the drive and the system. Please make sure that the drive is properly installed in the chassis or drive tray, and that the tray is properly installed in your DiskStation, and power off and on the system. If this issue persists, please contact Synology Support Team.",
"disk_wdda_curr_temp": "Abnormal operating temperature has been detected. Please check for room temperature and the rear ventilation ports that may be affecting the temperature, or go to Control Panel > Hardware & Power > Fan Speed Mode to change the fan operation setting. If this issue persists, please stop using drive until temperature has changed to recommended operating range, or contact Synology Support Team.",
"disk_wdda_description": "Western Digital Device Analytics (WDDA) provides operational parameters to help you further manage the WD Purple, Red Plus, and Red Pro HDDs. Once WDDA is enabled, the system will collect the data on your drive every hour and notify you when potential issues are detected. To enable WDDA, please go to Storage Manager > HDD/SSD > Settings.",
"disk_wdda_enable": "Enable the WDDA service",
"disk_wdda_enable_desc": "With the Western Digital Device Analytics (WDDA) service enabled, all the drives supporting this service will be tested regularly.",
"disk_wdda_full_title": "Western Digital Device Analytics",
"disk_wdda_hard_reset": "An excessive quantity of hard reset event is detected within a short period of time. This may indicate a potential drive and/or system issue. Please make sure that the drive is properly installed in the chassis or drive tray, and that the tray is properly installed in your DiskStation, and run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_head_load": "The drive has accumulated a large number of lifetime head loads. This may cause a wear down to the drive over time and result in early storage failure. Please consider to replace the drive soon. For more information, please contact Western Digital support team.",
"disk_wdda_head_load_r": "An excessively high head load rate is detected. This may cause a wear down to the drive over time and result in early storage failure. Please check and adjust the software settings to reduce the frequency of the drive spin-down and the power cycles to extend the life of the drive. For more information, please contact Western Digital support team.",
"disk_wdda_high_temp": "Abnormally high operating temperature has been detected. Please make sure that the rear ventilation ports are not blocked, and please try to lower the ambient temperature. If the temperature is still high, please go to Control Panel > Hardware & Power > Fan Speed Mode to change the fan operation to a higher speed. If this issue persists, please contact Synology Support Team.",
"disk_wdda_icrc": "An excessive quantity of interface CRC event is detected during data transmission within a short period of time. This may indicate a potential drive and/or system issue. Please make sure that the drive is properly installed in the chassis or drive tray, and that the tray is properly installed in your DiskStation, and power off and on the system. If this issue persists, please contact Synology Support Team.",
"disk_wdda_low_temp": "Abnormally low operating temperature has been detected. Please try to increase the ambient temperature or go to Control Panel > Hardware & Power > Fan Speed Mode to change the fan operation to a lower speed. If this issue persists, please contact Synology Support Team.",
"disk_wdda_mech_fail": "An excessive quantity of mechanical failures are detected within a short period of time. This may prevent the drive from achieving its normal operating condition. Please run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_power_hours": "The drive has accumulated a large number of power on hours throughout the entire life of the drive. Please consider to replace the drive soon. For more information, please contact Western Digital support team.",
"disk_wdda_power_reset": "The drive has accumulated a large quantity of power on events. Powering on and off frequently for the drive can cause a wear down to the drive over time. Please consider to replace the drive soon. For more information, please contact Western Digital support team.",
"disk_wdda_power_reset_r": "An excessive quantity of power on events is detected. Powering on and off frequently for the drive can cause a wear down to the drive over time. If the current power on reset rate is excessively high, it could potentially lead to an early storage failure. Please reduce the frequency of power cycles to extend the life of the drive. For more information, please contact Western Digital support team.",
"disk_wdda_read_recovery": "please contact Western Digital support team.An excessive quantity of read recovery events in detected within a short period of time. This occurs when the host system requests data and the drive must go through multiple read recovery attempts to fully read the requested data. This may indicate a potential drive and/or environmental system issue. Please make sure that the drive is properly installed in the chassis or drive tray, and that the tray is properly installed in your DiskStation, and run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_reallocate": "An excessive quantity of reallocated sector event is detected within a short period of time. This occurs when the drive decides a location on the media should no longer be used to store data and moves the data at that location to another location on the media. This may indicate a potential drive and/or environmental system issue. Please run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_setting_note": "Only <a href=\"{0}\" target=\"_blank\" class=\"link-font\">certain drive models</a> support WDDA.",
"disk_wdda_setting_title": "Western Digital Device Analytics (WDDA)",
"disk_wdda_soft_reset": "An excessive quantity of soft reset event is detected within a short period of time. This may indicate that one or more commands sent to the drive is not completed and is interrupted by the host system. This may indicate a potential drive and/or system issue. Please make sure that the drive is properly installed in the chassis or drive tray, and that the tray is properly installed in your DiskStation, and run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_title": "WDDA",
"disk_wdda_unc": "An excessive quantity of uncorrectable error event is detected within a short period of time, due to the host system requests data that the drive is unable to fully read. This may indicate a potential drive and/or environmental issue. Please make sure your drive and DiskStation are placed on a stable surface, and run the Extended S.M.A.R.T. Self-Test. If this issue persists, please contact Western Digital support team.",
"disk_wdda_work_load": "The drive has accumulated a large number of data access. Excessive reading and writing to a drive can cause a wear down to the drive over time. Please consider to replace the drive soon. For more information, please contact Western Digital support team.",
"disk_wdda_work_load_r": "An excessively high read and write rate is detected. This may cause a wear down to the drive over time and result in early storage failure. Please reduce the drive data access rate to extend the life of the drive. For more information, please contact Western Digital support team.",