Volume 2013, Issue 1 359634

Research Article

Open Access

Identifying the Association Rules between Clinicopathologic Factors and Higher Survival Performance in Operation-Centric Oral Cancer Patients Using the Apriori Algorithm

Jen-Yang Tang

orcid.org/0000-0002-5690-0708

Department of Radiation Oncology, Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Department of Radiation Oncology, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan kmu.edu.tw

Cancer Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

Li-Yeh Chuang,

Li-Yeh Chuang

Department of Chemical Engineering and Institute of Biotechnology and Chemical Engineering, I-Shou University, Kaohsiung, Taiwan isu.edu.tw

Search for more papers by this author

Edward Hsi,

Edward Hsi

orcid.org/0000-0002-8272-6707

Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

Yu-Da Lin,

Yu-Da Lin

orcid.org/0000-0001-5100-6072

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan kuas.edu.tw

Search for more papers by this author

Cheng-Hong Yang,

Corresponding Author

Cheng-Hong Yang

[email protected]

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan kuas.edu.tw

Search for more papers by this author

Hsueh-Wei Chang,

Corresponding Author

Hsueh-Wei Chang

[email protected]

orcid.org/0000-0002-2060-224X

Cancer Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

Jen-Yang Tang,

Jen-Yang Tang

orcid.org/0000-0002-5690-0708

Department of Radiation Oncology, Faculty of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Department of Radiation Oncology, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan kmu.edu.tw

Cancer Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

Li-Yeh Chuang,

Li-Yeh Chuang

Department of Chemical Engineering and Institute of Biotechnology and Chemical Engineering, I-Shou University, Kaohsiung, Taiwan isu.edu.tw

Search for more papers by this author

Edward Hsi,

Edward Hsi

orcid.org/0000-0002-8272-6707

Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

Yu-Da Lin,

Yu-Da Lin

orcid.org/0000-0001-5100-6072

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan kuas.edu.tw

Search for more papers by this author

Cheng-Hong Yang,

Corresponding Author

Cheng-Hong Yang

[email protected]

Department of Electronic Engineering, National Kaohsiung University of Applied Sciences, Kaohsiung, Taiwan kuas.edu.tw

Search for more papers by this author

Hsueh-Wei Chang,

Corresponding Author

Hsueh-Wei Chang

[email protected]

orcid.org/0000-0002-2060-224X

Cancer Center, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan kmu.edu.tw

Search for more papers by this author

First published: 25 July 2013

https://doi.org/10.1155/2013/359634

Citations: 18

Academic Editor: Tsair-Fwu Lee

Share a link

Email
Wechat
Bluesky

Abstract

This study computationally determines the contribution of clinicopathologic factors correlated with 5-year survival in oral squamous cell carcinoma (OSCC) patients primarily treated by surgical operation (OP) followed by other treatments. From 2004 to 2010, the program enrolled 493 OSCC patients at the Kaohsiung Medical Hospital University. The clinicopathologic records were retrospectively reviewed and compared for survival analysis. The Apriori algorithm was applied to mine the association rules between these factors and improved survival. Univariate analysis of demographic data showed that grade/differentiation, clinical tumor size, pathology tumor size, and OP grouping were associated with survival longer than 36 months. Using the Apriori algorithm, multivariate correlation analysis identified the factors that coexistently provide good survival rates with higher lift values, such as grade/differentiation = 2, clinical stage group = early, primary site = tongue, and group = OP. Without the OP, the lift values are lower. In conclusion, this hospital-based analysis suggests that early OP and other treatments starting from OP are the key to improving the survival of OSCC patients, especially for early stage tongue cancer with moderate differentiation, having a better survival (>36 months) with varied OP approaches.

1. Introduction

In Taiwan, betel nut chewing, cigarette smoking, and alcohol consumption have been found to be highly associated with oral cancer [1], with habitual betel nut chewers showing a particular high prevalence [2–4]. Oral cancer is one of the 10 most prevalent cancers in Taiwan, mostly classified as oral squamous cell carcinoma (OSCC) [5], which has high rates of morbidity and mortality [6] because diagnosis often only takes place in the later stages [7]. Although many tumor markers [8–10] and single nucleotide polymorphism (SNP) markers [11] have been reported as being associated with oral cancer, outcome-based studies focusing on oral cancer therapy are lacking.

The survival of OSCC patients following surgical therapy has been reported to be affected by tumor size, nodal metastasis, staging, and differentiation [12]. Some researchers have been further concerned with factors involved in outcomes for postoperative radiotherapy for OSCC patients [13]. However, the correlation between the multiple survival affecting factors for predicting the well survival of OSCC therapy is less addressed and remains a challenge.

Recently, several computational methodologies have been introduced to analyze the relationship between multiple factors and therapies for several non-OSCC diseases, including machine learning algorithms [14], data mining [15], decision tree-based learning [16], and rule-based multiscale simulations [17].

The Apriori algorithm is used here to explore the correlation between clinical factors and good survival outcomes (i.e., >36 months) in operation- (surgery-) centric treatments, including operation alone, operation/IA, and operation/IA, CT, IV, and RT, where IA, IV, CT, and RT, respectively stand for intra-arterial, intravenous, oral chemotherapies, and radiotherapy. The study aims to computationally evaluate the correlation between clinicopathological factors and survival outcomes in 493 OSCC patients treated by operation alone or by operation followed with other nonsurgical treatments.

2. Materials and Methods

2.1. Data Source

The database used to construct our cases and control groups was obtained from the chart registry of cancer center of the Kaohsiung Medical University Hospital from 2004 to 2010. Patients were excluded if they had distant metastases at presentation, did not complete the therapeutic protocol in Kaohsiung Medical University Hospital, or had incomplete records. A total of 493 patients fulfilled the requirements and were included for further analyses (the raw data set is available at http://bioinfo.kmu.edu.tw/OP_high-OP_low_groups.xlsx). The patients were followed at Kaohsiung Medical University Hospital. The last followup was recorded from the last outpatient visit or the date of death. This use of patient data and the study design were reviewed and approved by the Institutional Review Board of Kaohsiung Medical University Hospital (KMUH-IRB-EXEMPT-20130029).

2.2. Introduction of the Apriori Algorithm

The problem for association rule learning can be stated as follows. Let I = {i₁, i₂, …, i_m} be a set of literals, called items. Let transaction T be a set of items, where T⊆I. Let D be a set of transactions. The objective of the association rule is an implication of the form A⇒B, where A ⊂ I and B ⊂ I, if A∩B = Ø. The rule A⇒B holds in the transaction set D with confidence c if c% of transactions in D that contain A also contain B. The rule A⇒B has support s in the transaction set D if s% of transactions in D contain A ∪ B. Item sets with the minimum support s are called large itemsets, and the others small itemsets.

The Apriori algorithm was proposed by Agrawal and Srikant in 1994 [18] and has been widely used for frequent itemset mining and association rule learning in databases. The Apriori algorithm aims to generate the desired rules from large itemsets. The general idea is that if items ABCD are large itemsets, then any rule in ABCD will have the minimum required support because ABCD is large; that is, AB⇒CD.

The Apriori algorithm can be divided into three steps. Algorithm 1 shows the pseudocode of the Apriori algorithm. The algorithm’s first pass counts item occurrences to screen the large itemsets (Section 2.2.1). The second pass generates the candidate itemsets C_k from large itemsets L_k−1, using the apriori-gen function (Section 2.2.2). Next, each transaction t checks whether the subsets of k-itemsets of t belong to C_k, called subset function and described in Section 2.2.3. Finally, each c counts item occurrences in C_t, and c will be stored in L_k if c.count minimum support. The algorithm terminates when L_k is empty; that is, no frequent set of k or more items is present in D.

Algorithm 1: Pseudocode of the Apriori algorithm.

01: L₁ = {l₁, … , l_n∣∀ l ∈ large itemsets} //see Section 2.2.1
02: set k = 2
03: while (L_k−1 ≠ Ø)
04: C_k = apriori-gen (L_k−1) = {c₁, … , c_p ∣ c∈ candidate k-itemsets}
// see Section 2.2.2
05: if (C_k = Ø)
06: return
07: end if
08: for (all t ∈D)
09: C_t = subset (C_k, t) // see Section 2.2.3
10: for (all c ∈ C_t)
11: c.count++
12: end for
13: end for
14: L_k = {c ∈ C_k∣c.count ≥ minsup}
15: k++
16: end while

2.2.1. Screening the Large 1-Itemsets

Algorithm 2 shows the pseudo code of first pass which simply counts item occurrences I = {i₁, i₂, …, i_m} to determine the large itemsets in all items. The array of item counts is used to count item occurrences, and elements in Item-counts having minimum support are included in the L₁ set.

Algorithm 2: The first pass of the Apriori algorithm.

01: for (all i∣1 ≤ i ≤ m)
02: set Item-counts [i] = 0
03: end for
04: for (all t ∈ D)
05: for (all i ∈ t)
06: Item-counts [i]++
07: end for
08: end for
09: L₁ = {l_i ∣Item-counts[i] ≥ minsup}

2.2.2. Candidate Set Generations

The function apriori-gen (L_k−1) generates C_k from L_k−1, and it returns a superset of the set of all large k-itemsets. Algorithm 3 shows the pseudo code of the function apriori-gen (L_k−1). We use a set c, c = {L_k−1.item[i]}, for all i ∈ {1, …, k − 1}, to store the frequent (k − 1)-itemsets in L_k−1. The selections of the pairs are called L_k−1.item_p, L_k−1.item_q ∈ L_k−1. For each L_k−1.item_p in L_k−1, we start the search tuples in the L_k−1.item_p and stop the search if we find L_k−1.item_q such that 1 to k − 2 items are not equal to the 1 to k − 2 items of L_k−1.item_p. Only if we find an L_k−1.item_q that satisfies L_k−1.item_p[i] = L_k−1.item_q[i], for all i ∈ {1, …, k − 2}, the c does create the k-itemset = {L_k−1.item_p[i], …, L_k−1.item_p[k − 2], L_k−1.item_p[k − 1], L_k−1.item_q[k − 1]}. Finally, c checks whether the subsets of c are included in L_k−1.

Algorithm 3: Pseudocode of the function apriori-gen().

01: Function apriori-gen (L_k−1)
02: set C_k← Ø
03: for (all L_k−1.item_p, L_k−1.item_q ∣ L_k−1.item_p[i] =
L_k−1.item_q[i], ∀i ∈{1, … , k − 2})
04: c ={L_k−1.item_p[1], … , L_k−1.item_p[k−2], L_k−1.item_p
[k−1], L_k−1.item_q [k−1]}
05: if (∀L_k−1.item ⊂ c)
06: C_k ← C_k ∪ c
07: end if
08: end for
09: end Function

2.2.3. Candidate Set Counts Using Hash Tree

After the candidate sets C_k are generated, the C_k are stored in a hash tree created by the function subset (C_k, t). The leaf of the hash tree comprises the pointers to C_k and the associated counters, and the leaf refers to distinct partitions of C_k. In the hash tree, the hash function can be used to insert the candidate itemsets and search the transaction subsets in C_k. The hash function is hash(i) = imod T, T < m, where T is a constant, and m is the number of items. Function subset (C_k, t) is a recursive function which traverses the tree starting from the root node to the leaves, with each item in t = {i₁, …, i_d} chosen as a possible starting item of a candidate itemset. It is applied at every level of the tree. When t reaches a leaf of the tree, all candidate itemsets are checked against t and their counters are updated.

2.3. Statistics Analysis

Statistical analysis was performed with JMP version 9. All statistical tests were done at a 0.05 significance level.

3. Results and Discussion

3.1. Demographic Data and Survival

3.1.1. Age and Survival

As shown in Table 1, all patients were categorized into 2 groups based on whether the survival is greater or less than 36 months. In this regard, no difference in varied age groups can be found. This is probably because anyone who was eligible for surgical resection would have comparable survival rates.

Table 1. Demographic data of 493 enrolled patients with OSCC.

Characteristics	Survived months			P value^*1	5-year survival (%)	P value^*2
Characteristics	Total	>36 group	<36 group	P value^*1	5-year survival (%)	P value^*2
Age				0.7786		0.5556
<30	7	3	4		71.4
30~50	228	125	103		77.2
50~70	236	129	107		79.2
>70	22	14	8		63.6
Primary Site				0.7915		0.1957
Lip	36	24	12		86.1
Cheek mucosa	184	103	81		83.2
Gum	42	25	17		71.4
Tongue	175	88	87		72.0
Mouth floor	19	11	8		68.4
Palate	5	3	2		60.0
Retromolar	27	15	12		77.8
Vestibule	2	1	1		100.0
Nonspecific	3	1	2		100.0
Laterality^*3				0.3965		0.8612
00	37	22	15		73.0
01	230	123	107		79.1
02	223	123	100		76.7
03	3	3	0		66.7
04	0	0	0		NA
Grade/differentiation				0.1476		0.0006
01	287	156	131		80.1
02	123	60	63		65.0
03	7	5	2		57.1
04	1	1	0		100.0
09	75	49	26		89.3
Regional lymph nodes examined				0.1550		0.1424
<5	285	160	125		80.4
>10	134	65	69		73.1
5~10	73	45	28		74.0
Clinical stage group				0.0749		0.5689
Stage 0	4	0	4		75.0
Stage 1	141	79	62		80.1
Stage 2	73	47	26		71.2
Stage 3	131	69	62		77.1
Stage 4	82	50	32		72.0
Pathologic stage group				0.2540		0.0514
Stage 0	2	2	0		100.0
Stage 1	215	112	103		82.3
Stage 2	92	52	40		75.0
Stage 3	31	15	16		74.2
Stage 4	58	24	34		67.2
Clinical tumor size				0.3967		0.0004
<2 cm	162	100	62		87.0
2~4 cm	244	134	110		71.3
>4 cm	33	19	14		66.7
Pathology tumor size				0.4417		0.0141
<2 cm	197	114	83		81.7
2~4 cm	183	94	89		69.4
>4 cm	25	14	11		72.0
OP group^*4				<0.0001		<0.0001
01	385	238	147		81.6
02	27	14	13		66.7
03	81	19	62		61.7

^*1P value for the comparison of the survival between >36 and <36 months groups.
^*2P value for 5-year survival among the items of the same characteristics group.
^*30: unknown primary site or the shape of the organ is not paired; 1: the primary site is originated from the right side; 2: the primary site is originated from the left side; 3: only one side is invaded but it is not clear which side (R′t or L′t) it is originated from; 4: both sides are invaded but the origin of the primary site is not clear and the chart record describes only one primary site.
^*4OP group for 01: OP only; 02: OP→IA; 03: OP→CT, OP→CT + IV, OP→CT→RT, OP→IA→RT, OP→IV,OP→IV→RT, OP→RT, OP→RT + CT,OP→RT + IV, OP→RT→CT, OP→RT→IA, OP→RT→IV. Symbols: OP: operation; IA: intraarterial chemotherapy; CT: oral chemotherapy; IV: intravenous chemotherapy; RT: radiotherapy; →: then.

3.1.2. Subsites and Survival

As shown in Table 1, the site distribution of the 493 cases of oral cancer patients showed common affected sites including the cheek mucosa, gum, tongue, and retromolar trigon. Postsurgical organ function and cosmetics may vary with surgical site, but no difference to survival could be found.

3.1.3. Laterality and Survival

As shown in Table 1, laterality is recorded in the database of cancer registries and is a mixed expression of clinical/pathological tumor size and location. It does not play a significant role in the surgical group.

3.1.4. Grade and Survival

As shown in Table 1, comparison of the pathological characteristics between >5-year (n = 271) and <5-year survival (n = 222) revealed better treatment outcomes for low grade tumors (P = 0.0006), suggesting that well-differentiated tumors are less aggressive and thus are associated with better overall survival.

3.1.5. Regional Lymph Nodes and Survival

As shown in Table 1, regional lymph node examination might express the details and quality of surgical resection. However, the number of examined lymph nodes was not found to have an effect on survival. This might be due to cross-interaction between clinical lymph node stages and overall survival.

3.1.6. Clinical Stages, Pathology Stages, Clinical/Pathology Tumor Sizes, and Survival

As shown in Table 1, neither clinical nor pathological stages were found to have an impact on 5-year survival. There might be some influencing factors between low- and high-tumor stages which cannot be simply explained by surgery. However, for clinical/pathological tumor size alone, significant differences between >5-year and <5-year groups are found (P = 0.0004 and P = 0.0141, resp.). Smaller tumor size means less tumor burden and has less surrounding tissue infiltration, which may explain improved overall outcomes.

3.1.7. Surgical Modalities and Survival

As shown in Table 1, treatment modalities (OP) were further differentiated into 3 groups based on different adjuvant therapies, that is, surgery alone, surgery plus intra-arterial chemotherapy, and surgery plus concomitant chemoradiotherapy. Significant differences between groups were found (P < 0.0001), and further analysis of surgical modalities based on the clinical/pathological stages could produce interesting insights.

This hospital-based study followed nearly 500 patients with oral squamous cell carcinoma after surgical treatment. Results showed that age of onset and laterality of tumor location did not influence the treatment outcome. The latter might be attributed to oral cancer being a less multifocal or multicentric disease than, for example, breast cancer and, hence, laterality of the primary tumor has less influence on survival. These findings are in line with previous findings [19, 20].

Advanced tumor stage or failure of locoregional control negatively influences survival in patients with OSCC [21]. However, we did not observe a significant influence from either clinical or pathological tumor stages. Similar to our findings, Pandey et al. reported no difference in survival rates for the extent of tumor [22], and the observed difference might be due to the facts that all stages of tumor have been poured in the analysis.

In the present study, multimodality treatment proved to be a prognostic factor. Benefit from systemic or adjuvant local therapies might correlate with disease biology as the grade of tumor differentiation was also an important influencing factor.

3.2. Data Mining Results Using Apriori Algorithm

Table 2 shows the best rules for OP > 36 months. The head Y and body X represent a class association rule X⇒Y which means the head Y of an association rule X⇒Y (with rule body X) must be restricted to one attribute-value pair. The attribute of the attribute-value pair is thus the class attribute. The resulting rules can be evaluated according to three metrics: confidence, lift, and leverage. The minimum value of 1.5 for lift (or improvement) is computed as the confidence of the rule divided by the support of the right-hand-side (RHS). The lift represents the ratio of probability. Given a rule X⇒Y, X and Y occur together to the multiple of the two individual probabilities for X and Y; that is,

(1)

Table 2. Ranking of the top 10 best rules found in survival larger than 36 months.

Body^*1	No.	Head^*1	No.	Confidence	Lift^*2	Leverage	Conviction
Grade/differentiation = 2 Clinical stage group = early	49	Primary site = tongue Group = OP	27	0.55	1.91	0.05	1.52
Primary site = tongue Group = OP	78	Grade/differentiation = 2 Clinical stage group = early	27	0.35	1.91	0.05	1.23
Primary site = tongue Clinical stage group = early	70	Grade/differentiation = 2 Group = OP	27	0.39	1.9	0.05	1.27
Grade/differentiation = 2 Group = OP	55	Primary site = tongue Clinical stage group = early	27	0.49	1.9	0.05	1.41
Grade/differentiation = 2	60	Primary site = tongue Clinical stage group = early Group = OP	27	0.45	1.88	0.05	1.34
Primary site = tongue Clinical stage group = early Group = OP	65	Grade/differentiation = 2	27	0.42	1.88	0.05	1.3
Primary site = tongue	88	Grade/differentiation = 2 Clinical stage group = early Group = OP	27	0.31	1.81	0.04	1.18
Grade/differentiation = 2 Clinical stage group = early Group = OP	46	Primary Site = tongue	27	0.59	1.81	0.04	1.55
Grade/differentiation = 2	60	Primary site = tongue Clinical stage group = early	27	0.45	1.74	0.04	1.31
Primary site = tongue Clinical stage group = early	70	Grade/differentiation = 2	27	0.39	1.74	0.04	1.24

^*1Stages 0 to 3 of clinical stage group and pathologic stage group as shown in Table 1 are regarded as early and stage 4 is regarded as late stage in Table 2.
^*2The best rules with lift >1.5 were shown here.

If lift is 1, X and Y are independent. The higher lift is above 1, the more likely that the existence of X and Y together in a transaction is due to a relationship between them and not just random occurrence. Unlike lift, leverage measures the difference between the probability of co-occurrence of X and Y as the independent probabilities of each of X and Y; that is,

(2)

Leverage measures the proportion of additional cases covered by both X and Y above those expected if X and Y were independent of each other. Thus, for leverage, values above 0 are desirable whereas values greater than 1 are desirable for lift. Finally, conviction is similar to lift, but it measures the effect of the right-hand side not being true and also inverts the ratio. Conviction is measured as

(3)

Table 2 shows that the rule “grade/differentiation = 2 and clinical stage group = early” is associated with the rule “primary site = tongue and group = OP.” The rule shows 49 patients as being grade/differentiation = 2 and clinical stage group = early, while 27 of these 49 patients fulfill the rules “primary site = tongue and group = OP.” The confidence shows the proportion of the rule “primary site = tongue and group = OP” in the rule “grade/differentiation = 2 and clinical stage group = early,” that is, 27/49. The lift is 1.91, meaning the existence of rule “grade/differentiation = 2 and clinical stage group = early” and rule “primary site = tongue and group = OP” together in a transaction is not just a random occurrence. The leverage value of 0.05 means that the proportion of additional cases covered by both rule “grade/differentiation = 2 and clinical stage group = early” and rule “primary site = tongue and group = OP” are greater than those that would be expected if these two rules were independent of each other. The conviction value of 1.52 indicates the effect of the right-hand side is not being true.

From the top down in Table 2, the lift values gradually decrease but still show a high correlation between the body/head and survival of >36 months. When the Apriori algorithm-based lift value of the items listed in “body” and “head” of Table 2 is high, there is less chance of misinterpretation of the relationships between each item. Judging by the top 8 results, the same items such as grade/differentiation = 2, clinical stage group = early, primary site = tongue, and group = OP flowed between the “body” and “head”. These data suggest that early stage tongue cancer with moderate differentiation will have a better survival (>36 months) with varied surgical approaches where the OP has three kinds of treatments.

Judging by the top 9 to 10 results, however, only three items are included without the group = OP and their lift values are decreased to 1.74. These results suggest that the factor of “group = OP” is not important to the top 9 to 10 results and is less strongly correlated compared with the top 8 results. It also implies that the OP plays an important role in creating a correlation with improved survival (>36 months). In clinical settings, this might be due to good treatment outcome which often accompanies surgery.

Accordingly, our proposed Apriori algorithm is a relatively simple form of rule-based computation to identify potential rules involving various factors, such as grade/differentiation = 2, clinical stage group = early, primary site = tongue, and group = OP. The algorithm can reveal the combination effect of these factors on the outcome of OSCC therapy.

4. Conclusion

This hospital-based analysis reviewed 493 patients with OSCC to mine survival factors in operation-centric patients. The results identify the importance of grade/differentiation = 2, clinical stage group = early, primary site = tongue, and group = OP in predicting higher survival for OSCC patients.

Conflict of Interests

The authors have no conflict of interests to declare.

Acknowledgments

This work was partly supported by the National Science Council in Taiwan (under Grant no. NSC101-2320-B-037-049, NSC101-2622-E-151-027-CC3, and NSC102-2221-E-151-024-MY3), by the Department of Health (DOH102-TD-C-111-002), and by NSYSU-KMU Joint Research Project (NSYSUKMU 102-034).

References

1 Ko Y. C., Huang Y. L., Lee C. H., Chen M. J., Lin L. M., and Tsai C. C., Betel quid chewing, cigarette smoking and alcohol consumption related to oral cancer in Taiwan, Journal of Oral Pathology and Medicine. (1995) 24, no. 10, 450–453, 2-s2.0-0029394950.
Google Scholar
2 Ko Y.-C., Chiang T.-A., Chang S.-J., and Hsieh S.-F., Prevalence of betel quid chewing habit in Taiwan and related sociodemographic factors, Journal of Oral Pathology and Medicine. (1992) 21, no. 6, 261–264, 2-s2.0-0026709264.
Google Scholar
3 Yang M.-S., Su I.-H., Wen J.-K., and Ko Y.-C., Prevalence and related risk factors of betel quid chewing by adolescent students in southern Taiwan, Journal of Oral Pathology and Medicine. (1996) 25, no. 2, 69–71, 2-s2.0-0030088076.
Google Scholar
4 Lin C.-F., Wang J.-D., Chen P.-H., Chang S.-J., Yang Y.-H., and Ko Y.-C., Predictors of betel quid chewing behavior and cessation patterns in Taiwan aborigines, BMC Public Health. (2006) 6, article 271, 2-s2.0-33751190736, https://doi.org/10.1186/1471-2458-6-271.
Web of Science® Google Scholar
5 Chien M.-H., Ying T.-H., Hsieh Y.-H., Lin C.-H., Shih C.-H., Wei L.-H., and Yang S.-F., Tumor-associated carbonic anhydrase XII is linked to the growth of primary oral squamous cell carcinoma and its poor prognosis, Oral Oncology. (2012) 48, no. 5, 417–423, 2-s2.0-84860001243, https://doi.org/10.1016/j.oraloncology.2011.11.015.
Google Scholar
6 Markopoulos A. K., Current aspects on oral squamous cell carcinoma, The Open Dentistry Journal. (2012) 6, 126–130.
Google Scholar
7 van der Waal I., de Bree R., Brakenhoff R., and Coebergh J.-W., Early diagnosis in primary oral cancer: is it possible?, Medicina Oral, Patología Oral y Cirugía Bucal. (2011) 16, no. 3, e300–e305, 2-s2.0-79957938816, https://doi.org/10.4317/medoral.16.e300.
PubMed Web of Science® Google Scholar
8 Yen C.-Y., Chen C.-H., Chang C.-H., Tseng H.-F., Liu S.-Y., Chuang L.-Y., Wen C.-H., and Chang H.-W., Matrix metalloproteinases (MMP) 1 and MMP10 but not MMP12 are potential oral cancer markers, Biomarkers. (2009) 14, no. 4, 244–249, 2-s2.0-68549127131, https://doi.org/10.1080/13547500902829375.
Web of Science® Google Scholar
9 Yen C. Y., Huang C. Y., Hou M. F. et al., Evaluating the performance of fibronectin 1 (FN1), integrin alpha4beta1 (ITGA4), syndecan-2 (SDC2), and glycoprotein CD44 as the potential biomarkers of oral squamous cell carcinoma (OSCC), Biomarkers. (2013) 18, no. 1, 63–72, https://doi.org/10.3109/1354750X.2012.737025.
Google Scholar
10 Lee C. H., Yen C. Y., Liu S. Y. et al., Axl is a prognostic marker in oral squamous cell carcinoma, Annals of Surgical Oncology. (2012) 19, no. supplement 3, S500–S508, https://doi.org/10.1245/s10434-011-1985-8.
Google Scholar
11 Yen C.-Y., Liu S.-Y., Chen C.-H., Tseng H.-F., Chuang L.-Y., Yang C.-H., Lin Y.-C., Wen C.-H., Chiang W.-F., Ho C.-H., Chen H.-C., Wang S.-T., Lin C.-W., and Chang H.-W., Combinational polymorphisms of four DNA repair genes XRCC1, XRCC2, XRCC3, and XRCC4 and their association with oral cancer in Taiwan, Journal of Oral Pathology and Medicine. (2008) 37, no. 5, 271–277, 2-s2.0-42149086090, https://doi.org/10.1111/j.1600-0714.2007.00608.x.
Google Scholar
12 Lo W.-L., Kao S.-Y., Chi L.-Y., Wong Y.-K., and Chang R. C.-S., Outcomes of oral squamous cell carcinoma in Taiwan after surgical therapy: factors affecting survival, Journal of Oral and Maxillofacial Surgery. (2003) 61, no. 7, 751–758, 2-s2.0-0037903012, https://doi.org/10.1016/S0278-2391(03)00149-6.
Google Scholar
13 Brown J. S., Shaw R. J., Bekiroglu F., and Rogers S. N., Systematic review of the current evidence in the use of postoperative radiotherapy for oral squamous cell carcinoma, British Journal of Oral and Maxillofacial Surgery. (2012) 50, no. 6, 481–489, 2-s2.0-83655181189, https://doi.org/10.1016/j.bjoms.2011.08.014.
Google Scholar
14 Zhu M., Zhang Z., Hirdes J. P., and Stolee P., Using machine learning algorithms to guide rehabilitation planning for home care clients, BMC Medical Informatics and Decision Making. (2007) 7, article 41, 2-s2.0-38949083897, https://doi.org/10.1186/1472-6947-7-41.
Google Scholar
15 Toussi M., Lamy J.-B., Le Toumelin P., and Venot A., Using data mining techniques to explore physicians′ therapeutic decisions when clinical guidelines do not provide recommendations: methods and example for type 2 diabetes, BMC Medical Informatics and Decision Making. (2009) 9, article 28, 2-s2.0-67651162322, https://doi.org/10.1186/1472-6947-9-28.
Google Scholar
16 Hu Y. J., Ku T. H., Jan R. H., Wang K., Tseng Y. C., and Yang S. F., Decision tree-based learning to predict patient controlled analgesia consumption and readjustment, BMC Medical Informatics and Decision Making. (2012) 12, article 131, https://doi.org/10.1186/1472-6947-12-131.
Google Scholar
17 Hwang W., Hwang Y., Lee S., and Lee D., Rule-based multi-scale simulation for drug effect pathway analysis, BMC Medical Informatics and Decision Making. (2013) 13, no. supplement 1.
Google Scholar
18 Agrawal R. and Srikant R., Fast algorithms for mining association rules, Proceedings of the 20th International Conference on Very Large Data Bases (VLDB ′94), 1994.
Google Scholar
19 Yerushalmi R., Kennecke H., Woods R., Olivotto I. A., Speers C., and Gelmon K. A., Does multicentric/multifocal breast cancer differ from unifocal breast cancer? an analysis of survival and contralateral breast cancer incidence, Breast Cancer Research and Treatment. (2009) 117, no. 2, 365–370, 2-s2.0-68949106080, https://doi.org/10.1007/s10549-008-0265-1.
Google Scholar
20 Jan J.-C., Hsu W.-H., Liu S.-A., Wong Y.-K., Poon C.-K., Jiang R.-S., Jan J.-S., and Chen I.-F., Prognostic factors in patients with buccal squamous cell carcinoma: 10-year experience, Journal of Oral and Maxillofacial Surgery. (2011) 69, no. 2, 396–404, 2-s2.0-78751504446, https://doi.org/10.1016/j.joms.2010.05.017.
Google Scholar
21 Cooper J. S., Pajak T. F., Forastiere A. A., Jacobs J., Campbell B. H., Saxman S. B., Kish J. A., Kim H. E., Cmelak A. J., Rotman M., Machtay M., Ensley J. F., Chao C., Schultz C. J., Lee N., and Fu K. K., Postoperative concurrent radiotherapy and chemotherapy for high-risk squamous-cell carcinoma of the head and neck, The New England Journal of Medicine. (2004) 350, no. 19, 1937–1944, 2-s2.0-2342517421, https://doi.org/10.1056/NEJMoa032646.
Web of Science® Google Scholar
22 Pandey M., Bindu R., and Soumithran C. S., Results of primary versus salvage surgery in carcinoma of the buccal mucosa, European Journal of Surgical Oncology. (2009) 35, no. 4, 362–367, 2-s2.0-60849096961, https://doi.org/10.1016/j.ejso.2008.02.008.
Google Scholar

Citing Literature

All articles

Identifying the Association Rules between Clinicopathologic Factors and Higher Survival Performance in Operation-Centric Oral Cancer Patients Using the Apriori Algorithm

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Source

2.2. Introduction of the Apriori Algorithm

2.2.1. Screening the Large 1-Itemsets

2.2.2. Candidate Set Generations

2.2.3. Candidate Set Counts Using Hash Tree

2.3. Statistics Analysis

3. Results and Discussion

3.1. Demographic Data and Survival

3.1.1. Age and Survival

3.1.2. Subsites and Survival

3.1.3. Laterality and Survival

3.1.4. Grade and Survival

3.1.5. Regional Lymph Nodes and Survival

3.1.6. Clinical Stages, Pathology Stages, Clinical/Pathology Tumor Sizes, and Survival

3.1.7. Surgical Modalities and Survival

3.2. Data Mining Results Using Apriori Algorithm

4. Conclusion

Conflict of Interests

Acknowledgments

References

Citing Literature

References

Information

About Wiley Online Library

Help & Support

Opportunities

Connect with Wiley

Identifying the Association Rules between Clinicopathologic Factors and Higher Survival Performance in Operation-Centric Oral Cancer Patients Using the Apriori Algorithm

Abstract

1. Introduction

2. Materials and Methods

2.1. Data Source

2.2. Introduction of the Apriori Algorithm

2.2.1. Screening the Large 1-Itemsets

2.2.2. Candidate Set Generations

2.2.3. Candidate Set Counts Using Hash Tree

2.3. Statistics Analysis

3. Results and Discussion

3.1. Demographic Data and Survival

3.1.1. Age and Survival

3.1.2. Subsites and Survival

3.1.3. Laterality and Survival

3.1.4. Grade and Survival

3.1.5. Regional Lymph Nodes and Survival

3.1.6. Clinical Stages, Pathology Stages, Clinical/Pathology Tumor Sizes, and Survival

3.1.7. Surgical Modalities and Survival

3.2. Data Mining Results Using Apriori Algorithm

4. Conclusion

Conflict of Interests

Acknowledgments

References

Citing Literature

References

Related

Information