Topics covered in this article: |
Ⅰ. Motivation |
Ⅱ. Phases in which the experiment was carried out |
Ⅲ. Results obtained after Simulation |
Ⅳ. Conclusion |
Despite living in the modern 21st century, where manufacturing systems have begun to become more automated, the fabrication process of various electronic parts such as the Integrated Circuit (IC) may result in errors. While conducting research in any field, it is important to study patterns of any kind to collect, analyze and interpret results.
When it comes to wafer tests of any form, results are achieved using wafer maps, which provide the location of the failed dies on the wafer.
Wafer defect patterns have been proved to identify errors during several tests using advanced statistical analysis. Initially, older methods were used wherein the data obtained turned out to be inaccurate leading to more problems in conducting experiments. Therefore, various new detection methods have come up to get more precise and accurate results in diverse fields.
Ⅰ. Motivation
The quality of the training dataset and the Machine Learning (ML) technique was applied to have an impact on a prediction model's performance. Regardless of the ML algorithm, an inconsistent training set will negatively impact the prediction model's performance.
A collection of labeled wafer maps could be seen to have different levels of labeling mistakes. For the WM-811K database, for instance, nine defect kinds have been identified: Center, Donut, Edge-local, Edge ring, Local, Near-full, Random, Scratch, and None. With relatively acceptable recognition results, seven of the nine fault types with visually discernible patterns can be identified (higher than 80 percent). However, a more thorough examination of the database exposes a number of dubious designations, as illustrated in this section.
1. Inconsistent Labeling
Inconsistent labeling is illustrated in Fig. 1 with a few examples. Despite the fact that they appear to be very similar, Fig. 1(a) is labeled as the centre and Fig. 1(b) as a donut. Figures 1(c) and 1(d) both belong in the Scratch type category, however, Figure 1(c) is labeled as Donut. It is clear that Fig. 1(a) and Fig. 1(c) have incorrect labels, which makes supervised learning difficult.
Fig 1 Examples of inconsistent labeling in WM-811K (a) Center, (b) Donut, (c) Donut, (d) Scratch.
2. Multiple Patterns
Multiple fault patterns may be represented in a more intricate wafer map. These patterns may have only one type of fault or a variety of defect types. Examples of intricate wafer maps with mixed defect pattern types can be found in Fig. 2, while wafer maps with multiple defect patterns of the exact type can be discovered in Fig. 3.
Fig 2 Examples of mixed defect patterns in a single wafer map
Fig 3 Examples of multiple defect patterns of the same type
3. Special Rare Patterns
Fig 4 Examples of special patterns.
In certain cases, some special defect patterns may tend to occur which are not correctly recognized as a distinct pattern type. Most of the time, these wafer maps are classified under a vague but related fault category. For instance, Local, None, Scratch, and Local are the labels for the four wafer maps in Fig. 4. In reality, each of the four defect patterns in Fig.4 can be categorized as a distinct sort of defect pattern. Nearly half of the wafers in Fig. 4(a) are defective dies, hence this figure should be categorized as Half-full. Fig. 4(b) does not show any pattern, yet the defect density is between None (low density) and Random (high density). Parallel (or vertical) lines make up the flawed pattern in Fig 4(c), whereas in Figure 4(d).
4. Semi-Supervised Framework
In order to make sure supervised learning works, accurate labeling of the training data is very crucial. To solve problems of inaccurate data, semi-supervised learning is one of the ideal methods as prediction models are built using the initial manually labeled data. The remainder of the unlabeled data which is not used is then classified using the model. Additionally, they are also used to identify labeled samples that do not fit the model. This algorithm is then trained using the improved labeling in different iterations.
If these samples cannot be assigned to an existing defect type, they are processed by the unsupervised algorithm to produce new defect types. The following cycle of the supervised learning algorithm is then trained using the improved labeling. To get the final enhanced labeling and categorization, the entire process might be repeated multiple times.
Ⅱ. Phases in which the experiment was carried out
A total of three phases were used in which data was studied through the experiment. The initial two phases were extracted using machine learning algorithms after which labeled wafer maps were used to build a prediction model in the third phase.
Fig 5 Semi-supervised framework of wafer map pattern recognition with enhanced labeling.
1. Phase 1 - Preprocessing: All items in the WM-811K database may not be specified properly and around 1.84 % of the wafer maps will be removed. The next step is to discard random defects to obtain cleaner patterns containing two local defect patterns and some randomly distributed defect patterns.
2. Phase 2 - Feature Extraction: The machine learning algorithms will extract all the supervised and unsupervised features. For geometric features, cluster defects need to be located which can be done through connected-component analysis (CCA). The second will contain the statistic features where four classes will be employed.
3. Phase 3 - Classification Model Construction: In phase three, a low tolerance classification model must be formed as per the labeled data present. Here, all the non-categorized data will get screened.
4. Phase 4 - Classifying Unlabeled Data: In phase four, all the unlabelled data from phase three will be taken into consideration if the probability is greater than the threshold of the previous phase. If unclassified, it will classify as an “error wafer”.
5. Phase 5 - Unsupervised Learning: Here, similar wafer maps will get clustered which have not been labeled yet. This is done by a self-organized map (SOM) which is an artificial neural network (ANN) model.
6. Phase 6 - Enhanced Labeling: New SOM models need to be labeled manually as some groups are very similar to existing defect pattern types. Such existing defect pattern types will get clubbed into the already existing ones.
Ⅲ. Results obtained after Simulation
The training set of the labeled samples is where all the phases started in the first iteration. The total number of labeled samples increased as some unlabeled samples are grouped into a defect type after each iteration. After each fifth iteration, there was hardly any change in the quantity of labeled data.
Fig 6 Accuracy after each iteration
The accuracy achieved is shown in the above fig wherein the first iteration was provided for reference and the prediction model is constructed by using training data. On seeing the results, the first iteration showed very high accuracy (97%). But when closely observed, the actual accuracy was less than 80%. This was due to a large number of wafers with no detected patterns.
Ⅳ. Conclusion
To understand and study automatic wafer detection maps, the supervised model and its algorithm has proved to show positive results. The use of manual and machine learning models can both be used in different cases.
To improve and identify defect types, semi-supervised learning can be very helpful for a number of applications.