innovation.world

Product Design, Manufacturing & Innovation Resources

Home » Product Design » Methodologies » The 6 Must-know Statistical Tests for Quality & Engineering

The 6 Must-know Statistical Tests for Quality & Engineering

Lean Manufacturing, Process Improvement, Process Optimization, Quality Assurance, Quality Control, Quality Management, Six Sigma, Statistical Analysis, Statistical Tests

Statistical tests are the only way in quality and manufacturing to provide objective evidence for decision-making. They help identify variations in processes and distinguish between random fluctuations and actual problems. In engineering, statistics help identify patterns, outliers, and sources of failure in system performance, ensuring data-driven decision-making. By rigorously analyzing experimental results, engineers can validate product designs and manufacturing processes, detecting potential problems before implementation. This systematic approach reduces the risk of unexpected failures and enhances overall safety by ensuring reliability and compliance with international safety standards.

This post will review main statistical tests used in manufacturing and Total Quality Management (TQM).

Note: as they also concern engineering, research and science, the following 2 statistical tests and analyses

correlation analysis: measures the strength and direction of the relationship between two variables (e.g., Pearson correlation coefficient).
regression analysis: examines the relationship between variables (e.g., input factors and process output), from simple linear to multiple regression.

are not included here but in a specific article about main 10 algorithms for engineering.

Normality Tests

A well-lit laboratory setting, with a desk displaying various scientific equipment - beakers, pipettes, and a microscope. In the center, a whiteboard showcases a clear step-by-step process of hypothesis testing, with equations and diagrams explaining the fundamentals. The background features a bookshelf filled with technical manuals and industry-specific literature, conveying an atmosphere of academic rigor and attention to detail. The overall mood is one of scientific inquiry and problem-solving, with a focus on the systematic approach to quality control. — List the most used statistical tests for quality and engineering.

in the statistical tests world, many common statistical methods (t-tests, ANOVA, linear regression, etc.) assume that the data are normally/Gaussian distributed (or that the residuals/errors are normal). Violating this assumption can make the results unreliable: p-values can be misleading, confidence intervals may be wrong, and the risk of Type I/II errors increases. Note that some tests, like the 1-way ANOVA, can handle reasonably well a non-normal distribution.

Note: if your data is not normal, see real life cases below, you may need to use non-parametric tests (like the Mann-Whitney U test or Kruskal-Wallis test), which don’t assume normality, or transform your data, which are out of the scope of this post.

While several statistical tests exist for this, we will detail here the Shapiro-Wilk test, famous especially for small sample sizes, typically n < 50, but can be used up to 2000.

FYI, other common normality tests:

- Kolmogorov-Smirnov (K-S) test (with Lilliefors correction): works at better with larger sample sizes while being less sensitive than Shapiro-Wilk especially for small datasets
- Anderson-Darling test: is good with all sample sizes and has more sensitivity in the tails (extremes) of the distribution while being more powerful for detecting departures from normality in the extremes.

How-to perform the Shapiro-Wilk normality test

1. Calculate or compute the Shapiro-Wilk test statistic (W):

\(W = \frac{\left(\sum_{i=1}^{n} a_i x_{(i)}\right)^2}{\sum_{i=1}^{n} (x_i – \bar{x})^2}\)

Note: as the calculation of the \(a_i\) coefficients is nontrivial and generally requires a table or algorithm, which is why the Shapiro-Wilk test is nearly always computed by software such as R, Python’s SciPy, MS Excel add-ons or other dedicated softwares. For a manual calculation, this page provides all the \(a_i\) coefficients and p-value for samples up to 50.

The value of W ranges between 0 and 1 (W = 1: perfect normality. W < 1: the further it is from 1, the less normal your data are).

2. W is not enough. It works in conjunction with its corresponding p-value to have the confidence level. In the Shapiro-Wilk table, at the row of the n sample size, look for the closest value to your calculated W and get its corresponding p-value on the top

The numerator represents the squared sum of the weighted ordered sample values.

The denominator is the sum of the squared deviations from the sample mean (i.e., the sample variance, scaled by (n-1)).

\(x_{(i)}\) = the i-th order statistic (i.e., the i-th smallest value in the sample)

\(x_i\) = the i-th observed value

\(\bar{x}\) = the sample mean

\(a_i\) = constants (weights) calculated from the mean, variances, and covariances of the order statistics of a sample from a standard normal distribution ((N(0,1))), and depend only on n (sample size).

n = sample size

3. Result: if the p-value is greater than the chosen alpha-level (example 0.05), there is statistical evidence that the data tested are normally distributed.

For normality testing, it is frequently advised to mix a numerical method with a graphical method such as Henry’s line, Q-Q plots or histograms :

Mind Non-normal Distributions!

While normal/Gaussian distribution is the most frequent case, it should not be automatically assumed. Among daily counter-examples are:

Wealth and income distribution among individuals. It follows a Pareto (power law) distribution, skewed with a “long tail” of very wealthy individuals.
City population sizes in a country follow Zipf’s Law (power law), with a few very large cities and many small towns.
Earthquake magnitudes and frequency are a power law/Gutenberg-Richter distribution: small earthquakes are common, large ones are rare.
Daily price changes or returns in financial markets: fat-tailed/heavy-tailed distributions, not Gaussian; large deviations occur more frequently than predicted by a normal distribution.
Word frequencies in language, as the city population above, it follows a Zipf’s Law (power law): Few words are used often, most words are rare.
Internet traffic/website popularity: power law/long tail: Some sites have millions of hits, most have very few.
File sizes on computer systems: log-normal or power law, with a few very large files and many small ones.
Human lifespans/longevity: right-skewed (can model with Weibull or Gompertz distributions), not normal; more people die at older ages.
Social network connections follow a power law: few users have many connections; most have few.

Most of these are characterized by “few large, many small”, a signature of power laws, heavy tails, exponential or log-normal distributions, and not the symmetrical shape of the Gaussian.

The t-Test (Student’s t-Test)

The t-Test (aka “t of Student”), developed by William Sealy Gosset under the pseudonym “Student” in 1908, is a statistical test used to compare means when sample sizes are small and population variance is unknown. Focusing at comparing the means of two populations, it is one of the most used test in Manufacturing.

A meticulously crafted laboratory setting, with an array of scientific instruments and test equipment laid out on a sleek, modern workbench. Beakers, test tubes, and digital displays cast a soft, ambient glow, illuminated by precise, directional lighting from overhead. In the foreground, a statistical analysis program is open on a computer screen, displaying complex graphs and charts. The middle ground features an engineer in a white lab coat carefully recording data, while the background showcases a wall of technical diagrams and engineering schematics. The overall atmosphere conveys a sense of analytical rigor, technical expertise, and a commitment to quality control. — A laboratory setting with an array of scientific instruments performing statistical tests.

Purpose: the t-Test helps engineers and quality professionals determine if there is a statistically significant difference between the means of two groups or between a sample mean and a known standard. It’s commonly used in hypothesis testing to evaluate whether process changes or product modifications have led to real improvements or differences, beyond what could be expected by chance.

Practical examples in the industry:

In automotive manufacturing, a t-Test might be used to compare the tensile strength of steel from two different suppliers to ensure consistent quality.
In pharmaceuticals, the t-Test is used to analyze whether a new production process yields tablets with a mean weight significantly different from the standard.
In electronics, engineers may use the t-Test to verify if a design change in a circuit board results in a measurable improvement in electrical resistance.

How-to the Student’s t-Test

They are many variants of the t-test; the example here will focus on a so-called “two-sample t-test” in its “unpaired” version, comparing the samplings of 2 different productions batches.

State your null and alternative hypotheses; in this example “there is no difference between means” vs “there are different”
Collect your data from the 2 production batches being compared and calculate
- the 2 sample means \(\bar{X} = \frac{1}{n_1} \sum_{i=1}^{n_1} X_i\) and \(\bar{Y} = \frac{1}{n_2} \sum_{j=1}^{n_2} Y_j\)
- Calculate the 2 sample variances: \(S_X^2 = \frac{1}{n_1-1} \sum_{i=1}^{n_1} (X_i – \bar{X})^2\) and \(S_Y^2 = \frac{1}{n_2-1} \sum_{j=1}^{n_2} (Y_j – \bar{Y})^2\)
- sample sizes.
Calculate the test statistic. While the method assumes both samples are independent & both samples are from normally distributed populations, there is still two cases:
- if equal variances assumed (“pooled” t-test;): Pooled variance: \(S_p^2 = \frac{ (n_1-1)S_X^2 + (n_2-1)S_Y^2 }{ n_1 + n_2 – 2 }\)
  Test statistic: \(t = \frac{ \bar{X} – \bar{Y} }{ S_p \sqrt{ \frac{1}{n_1} + \frac{1}{n_2} } }\)
- if unequal variances (Welch’s t-test): Test statistic: \(t = \frac{ \bar{X} – \bar{Y} }{ \sqrt{ \frac{S_X^2}{n_1} + \frac{S_Y^2}{n_2} } }\) Degrees of freedom (approximate, Welch-Satterthwaite): \(df = \frac{\left( \frac{S_X^2}{n_1} + \frac{S_Y^2}{n_2} \right)^2}{ \frac{ (S_X^2 / n_1)^2 }{ n_1 – 1 } + \frac{ (S_Y^2 / n_2)^2 }{ n_2 – 1 } }\)
Use the calculated ( t ) and degrees of freedom (\(n_1+n_2-2\) for equal variances, or the Welch formula) to look up or compute the p-value from the t-distribution (depending on whether it’s a one-tailed or two-tailed test).
Result: compare the calculated t-value with the critical t-value from statistical tables based on your chosen confidence level and degrees of freedom; alternatively, use software for the p-value. If the t-statistic exceeds the critical value or the p-value is below your threshold (typically 0.05), reject the null hypothesis.

Link to the t-Test critical values table

🔒

The rest of this article is reserved for members

To limit scraping bots (currently 40,000 hits per day!),
we had to restrict access to full articles and tools to registered members only.

Log in → or Register (100% free) →

to access all the rest.

Topics covered: statistical tests, quality management, manufacturing processes, objective evidence, decision-making, normality tests, Shapiro-Wilk test, non-parametric tests, p-value, Type I error, Type II error, data-driven, regression analysis, correlation analysis, Total Quality Management (TQM), ANOVA, reliability, ISO 9001, ISO 25010, ISO 31000, ISO 9000, and ISO 17025..

Historical Context

Network engineer analyzing TCP/IP Layered Architecture in a modern office setting.

TCP/IP Layered Architecture

The Internet protocol suite's architecture is a conceptual model that divides communication functions into four abstraction layers: the Link Layer, the Internet Layer, the Transport Layer, and the Application Layer. This layered approach simplifies protocol design and development, as each layer handles specific tasks and interacts only with the layers immediately above and below it.

Network operations center showcasing Internet Protocol management and data routing.

Internet Protocol (IP)

Internet Protocol (IP) is the principal communications protocol in the Internet Layer for relaying datagrams across network boundaries. Its primary function is to deliver packets from a source host to a destination host based on their IP addresses. IP is a connectionless protocol that provides a best-effort delivery service, meaning it does not guarantee delivery, order, or data integrity.

Underground salt cavern for Compressed Air Energy Storage in thermodynamics applications.

Compressed Air Energy Storage (CAES)

Compressed Air Energy Storage (CAES) is a method to store energy generated at one time for use at another time. At a utility scale, energy is stored by compressing air and storing it in an underground reservoir, such as a salt cavern. When electricity is needed, the pressurized air is heated and expanded in a turbine, driving a generator.

Team collaboration in Total Quality Management meeting focusing on process improvement.

Total Quality Management (TQM)

Total Quality Management (TQM) is a management philosophy where all members of an organization participate in improving processes, products, services, and the culture in which they work. It aims for long-term success through customer satisfaction. TQM integrates quality-discipline into the culture and activities of a company, moving beyond simple product inspection to a holistic, organization-wide approach.

Industrial production line with engineers managing Takt time challenges in manufacturing.

Takt Time Implementation Challenges

Successful Takt time implementation requires a highly stable production environment. Common challenges include managing machine downtime, ensuring consistent quality to avoid rework, and balancing lines that produce multiple products with different work contents (mixed-model lines). Without addressing these sources of variability, a Takt-driven system can be brittle and fail to meet demand consistently.

Laser transmission welding process of thermoplastic components in polymer technology.

Laser Transmission Welding of Plastics

Laser transmission welding joins two overlapping thermoplastic parts by passing a laser beam through a laser-transmissive upper part to a laser-absorbent lower part. The absorbed laser energy heats and melts the interface. Clamping pressure fuses the molten layers, and upon cooling, a strong, clean weld is formed. This method is precise, non-contact, and creates minimal thermal stress or particulate contamination.

CNC machine with closed-loop control system and feedback devices in automation.

Closed-Loop Control in CNC Systems

High-precision CNC machines employ a closed-loop control system to ensure accuracy. This system uses feedback devices, such as rotary encoders on servomotors or linear scales on the machine axes, to continuously monitor the machine's actual position. The controller compares this real-time feedback with the commanded position from the program and makes immediate corrections, compensating for errors.

1974

1974

1978

1980

1980

1980

1980

1972

1974

1975-06-01

1980

1980

1980

1980

1980

B2B decision-making meeting with diverse roles in a modern office setting.

The Buying Center Model in B2B Decision-Making

The Buying Center is a model representing all individuals and groups within an organization who participate in a purchase decision. It's not a fixed unit but a set of roles assumed by different people for different purchases. These roles include initiators, users, influencers, deciders, approvers, buyers, and gatekeepers, each impacting the final decision through their specific function and authority.

Computer workstation analyzing Transmission Control Protocol in a professional setting.

Transmission Control Protocol (TCP)

TCP is a core protocol of the Transport Layer, providing reliable, ordered, and error-checked delivery of a stream of bytes between applications running on hosts. It is a connection-oriented protocol, meaning it establishes a connection via a three-way handshake before data transfer begins. This ensures data integrity at the cost of higher overhead compared to UDP.

Electronic device with FCC mark for electromagnetic compatibility certification.

The FCC Mark for EMC Compatibility

The FCC mark is a certification mark used on electronic products manufactured or sold in the United States. It signifies that the device's electromagnetic interference (EMI) is within the limits approved by the Federal Communications Commission (FCC). This regulation ensures that electronic devices do not interfere with radio communications and other equipment, maintaining the integrity of the radio spectrum.

Industrial Technology design office with engineers optimizing flat-pack furniture for assembly.

Design for X (DFX)

A design methodology where 'X' represents a specific product life cycle objective. DFX encompasses a collection of guidelines and techniques aimed at optimizing a product's design for a particular goal, such as manufacturability (DFM), assembly (DFA), reliability (DFR), or sustainability (DfS). This proactive approach addresses potential issues early in the design phase, reducing costs and improving quality.

Engineering office with engineers collaborating on CAD data exchange using IGES and STEP formats.

CAD Data Exchange: IGES and STEP

To address the inability of different CAD systems to share data, neutral file formats were created. The Initial Graphics Exchange Specification (IGES), developed in the late 1970s, was an early attempt. It was later superseded by the more robust and comprehensive STEP (Standard for the Exchange of Product model data, ISO 10303), which can represent full 3D models, assembly structure, and metadata.

Ecological wastewater treatment system with diverse ecosystems for water purification.

Living Machine

A patented form of ecological wastewater treatment and water reclamation system developed by Dr. John Todd. It uses a series of diverse ecosystems, including bacteria, algae, plants, snails, and fish, within controlled environments like tanks or greenhouses, to purify water. The system mimics the natural purification processes of wetlands and other aquatic ecosystems but in an intensified, engineered setting.

Technician conducting Phased Array Ultrasonic Testing on a metal component in a factory.

Phased Array Ultrasonic Testing (PAUT)

Phased Array Ultrasonic Testing (PAUT) employs a multi-element transducer where each element is pulsed independently with precise, computer-calculated time delays. By controlling this phasing, the resulting ultrasonic beam can be electronically steered, focused, and scanned without physically moving the probe. This provides rapid, detailed imaging of flaws, especially in complex geometries, surpassing conventional single-element techniques.

Engineers collaborating in a computer networking lab on User Datagram Protocol applications.

User Datagram Protocol (UDP)

The User Datagram Protocol (UDP) is a minimal, connectionless Transport Layer protocol. It provides a simple datagram service without the reliability, ordering, or flow control mechanisms of TCP. Its main advantages are low overhead and low latency, making it suitable for time-sensitive applications like DNS lookups, online gaming, and live video streaming, where speed is more critical than perfect reliability.

(if date is unknown or not relevant, e.g. "fluid mechanics", a rounded estimation of its notable emergence is provided)

Top Posts & Articles

Marketplace

The Dictator’s Guide to Marketplace Management (or The Art of Being Both Player and Referee)

Minimum Viable Product

Minimum Viable Product (MVP): Pro Tips

Product design mistakes to avoid

“Best” 10 Design Mistakes To Avoid

Mechanical Principles

90 Mechanical Principles to Get Savvy Design Solutions

geometrical roof

Forms Follow Function … in Product Design Especially

Less Is More. Why You Want To Design Simple

Less Is More. Why You Want To Design Simple

Top Original Tools

Free Proxies List

Free Updated Proxies List

$LaTeX Formula Editor$

LaTeX Formula Editor

Concept Explorer

Innovation.world’s Concept Explorer™

Design Review Tree™ (DRT)

The Design Review Tree™ (DRT): Double-check Your Product Design

Latest Patents Search

Free Latest Patents Search

Scientific Publications

Free Latest Scientific Publications Search