As the number and variety of battery-powered products increases, the marketplace becomes an attractive target for design and manufacturing companies diversifying their product portfolios. With this diversification comes quick time-to-market demands and engineers inexperienced with the new applications. This, in turn, increases pressure on the system designer to “just get the job done.”
In response to these conditions, the IEEE 1625 standard was developed by manufacturers of Li-ion/Li-ion polymer cells, battery packs, battery and power management semiconductors, and portable computing systems. The standard aims to offer design methodologies to facilitate development of reliable and safe battery-powered systems that still provide the desired features and functions.
The design methodologies described in the standard are based on lessons learned throughout the industry. As a result, IEEE 1625 can help newcomers to the Li-ion and Li-ion polymer battery-powered applications ensure the safety and reliability of their designs even when the speed of design is paramount.
What is IEEE 1625?
The standard covers design approaches that ensure reliable operation and that minimize the occurrence of faults leading to hazards in portable computing devices and other rechargeable battery-operated systems. From the beginning of the design, it is necessary to examine all facets of the portable design to ensure the reliability of the entire system, as well as that of its individual components, functions or subsystems.
The standard guides the system and subsystem designers through five major areas — system integration, cell, pack, host device and total system reliability. Also covered are the critical operational parameters and how they change with time and environment, the effects of extremes in temperature, and the management of component failure.
Gaining overall compliance requires conformance with each and every subsection of the standard. A portable computing device cannot achieve compliance without consideration of all the related subsystems — including the user (Fig. 1). To achieve compliance, designers of each subsystem must thoroughly review their designs both individually and in conjunction with other subsystems to identify faults that could propagate hazards. Once it has been ascertained that the subsystems all conform to their particular standard requirements, a further analysis is needed to assess the overall system compliance to ensure the design does not allow two faults of any type to propagate a hazard.
Applying IEEE 1625 to Battery Design
Although the designers of the overall system and its various subsystems are affected by this standard, here we focus on the designer of the battery to highlight where the standard is used and the benefits it can bring to safety and reliability of the overall system. A Design Failure Modes and Effects Analysis (DFMEA) approach is a typical example of an industry-wide methodology that can be used to highlight and prioritize the possible roots of faults and hazards.
The architecture and component selection of the battery design can be driven by factors other than the results of the DFMEA. Such factors include price, availability and size. This situation forces the DFMEA to be a living document that changes as the design unfolds.
Consider the design of an SBS1.1-compliant, 4-series by 1-parallel Li-ion battery. The architecture chosen is a new one, involving new ideas that could reduce size, components and overall cost. The architecture change makes close scrutiny of the design analysis more important because the ideas are new to all engineers involved in the design, which is where the standard provides the greatest benefit. Fig. 2 shows the change in architecture and helps clarify the DFMEA analysis.
The existing architecture has the battery management unit (BMU), first-level protection and second-level protection elements monitoring the cell stack to ensure a double level of coverage if one protection element fails. However, the new architecture has the BMU and analog front end (AFE) monitoring each other as well as the cell stack. Theoretically, this allows the second-level protection to be integrated with the first-level protection device.
The new architecture primarily consists of a BMU, an AFE integrated circuit (IC), current-sense resistor, safety FETs and a chemical fuse (Fig. 3). The BMU consists of a 3.3-V microcontroller with high-performance measurement capabilities and field-programmable flash. The AFE provides a high-voltage interface to the cell stack for voltage measurement along with cell balancing control, overcurrent protection and a low dropout regulator (LDO) to power the BMU.
Other components are needed to enable the necessary features and reach the level of reliability required. The pack and cell sections of the standard are directly relevant in this example analysis, although these sections, in turn, can be divided into subsections to aid design focus. The DFMEA tables show the possible fault modes of key discrete components in the battery electronics (Table 1) and BMU and AFE interaction issues (Table 2).
ICs and discrete components are an important focus for the overall design. The selection of suitably rated components is not covered in this example, but the DFMEA should be extended to include the possible failure modes of the different components including the printed circuit board (PCB) traces, vias and connection points. The design analysis also should be extended beyond this example to include at least second-level faults — that is, two faults occurring independently — to be IEEE-compliant and even a third level if the fault is deemed severe.
In the tables, each item is evaluated and scored for the following criteria:
- Severity of Fault (SEV)
- Probability of Occurrence (OCC)
- Difficulty of Detection (DET).
Typically, evaluation and scoring is based on experience and statistical reliability data. However, this data is not always available. In such cases, the guidance found in IEEE 1625 standard is beneficial because it forces important areas of the system to have specific protection levels and features.
From the tables, it easily can be seen that some design issues are more critical than others. The higher Risk Product Number (RPN) or Score value elements need to be scrutinized to ensure that a fault does not lead to a hazard. In the example analysis, the use of n-channel safety FETs rather than the traditional p-channel FETs is indicated (Table 3). This is a result of the DFMEA and its living nature — a high RPN value exists where p-channel FETs are used, but the RPN value is drastically reduced when n-channel FETs are substituted.
B) However, if the AFE is non-functional, the BMU and AFE protection is non-functional, and the safety FETs will be on due to absence of gate drive.
B) However, if the AFE is non-functional, the BMU and AFE protection is non-functional, and the safety FETs will be off due to absence of gate drive.
When a full-system DFMEA is completed, the prioritization of tasks and risk assessment can be made with greater confidence. Design changes also can be easily evaluated and rolled back into the DFMEA to ensure the change has not adversely affected other areas of the design.
References
Standard for Rechargeable Batteries for Portable Computing, IEEE1625.
For more information on this article, CIRCLE 341 on Reader Service Card