Non-Audit Service Fees and Financial Reporting Quality: A Meta-Analysis
I thank two anonymous reviewers for many helpful suggestions. I also thank Pauline Weetman, Bradley Pomeroy and seminar participants at the Auckland University of Technology for helpful comments.
Abstract
Auditing as a corporate governance mechanism has attracted considerable research attention. Because of the information asymmetry between corporate managers and outside shareholders, auditors are hired to provide independent assurance that financial statements are prepared following generally accepted accounting principles. The credibility of such assurance depends on the independence, both in fact and in appearance, of the auditor. Over the years, however, the independence of auditors has come under increased scrutiny because of their joint provision of both audit and non-audit services. A sizable literature on the impact of non-audit fees on financial reporting quality has developed. The evidence from this literature, however, remains inconclusive. This paper provides a meta-analysis of the available literature by assessing (a) the net effect of non-audit fees on financial reporting quality, and (b) whether there is homogeneity in the financial reporting quality proxies used in the extant literature. Findings suggest that the level of client-specific non-audit fees is associated with reduced financial reporting quality. However, the underlying studies used to conduct this meta-analysis are not homogenous.
The purpose of this paper is to present a meta-analysis of the association between fees for non-audit services or non-audit fees (hereafter NAF) and financial reporting quality (hereafter FRQ). The well-known agency problem between shareholders and managers gives rise to the hiring of auditors who provide independent assurance to corporate stakeholders that financial statements prepared by corporate managers comply with generally accepted accounting principles (GAAP) (Watts and Zimmerman, 1983). Auditing also plays a significant role in enforcing and protecting investors' rights by detecting expropriation by insiders (Newman et al., 2005) and benefits management by providing a signalling mechanism regarding the reliability of management-provided financial information. However, the discharge of such responsibilities requires auditors to be independent, both in fact and in appearance. An auditor who is independent in fact has the ability to make independent audit decisions despite the perceived lack of independence or if the auditor is placed in a potentially compromising position. Independence in fact, however, does not necessarily imply that auditors are perceived as independent by the stakeholders (appearance of objectivity).
Although it is difficult to prove independence in fact, it is possible using large samples to determine the association between NAF and some accounting outcomes, for example, discretionary accruals, earnings conservatism to assess auditor independence (DeFond et al., 2002; Frankel et al., 2002; Ashbaugh et al., 2003; Chung and Kallapur, 2003). With respect to independence in appearance, archival research investigates the association between NAF and market valuation of accounting earnings to assess whether independence in appearance has been impaired due to the joint provision of audit and non-audit services (hereafter NAS) (Krishnan et al., 2005; Francis and Ke, 2006; Ghosh et al., 2009).
The idea that high NAF may have a detrimental effect on the FRQ is rooted in the theoretical model developed by DeAngelo (1981) who relates auditor independence with client-specific future quasi rents defined as ‘the excess of revenues over avoidable costs, including the opportunity cost of auditing the next-best alternative client’ (italics in original, p.116). She develops a two-dimensional definition of audit quality that sets the standard for addressing auditor independence issue. An auditor will be considered independent if he/she can detect a material misstatement and then report it to the audit service consumers. If no client-specific quasi-rents are expected from a given client relationship then the auditor should have no economic incentive to conceal financial misstatements and therefore should be considered perfectly independent from the client. However, if the client-specific future quasi rents are high then auditors may compromise independence to retain such quasi rents. It has been argued that NAS provides more quasi rents than do audit services because of the higher margin derived from the former and therefore is more likely to impair auditor independence (Securities and Exchange Commission [hereafter SEC], 2000), particularly independence in appearance (Lindberg and Beck, 2004).1
The provision of NAS, however, provides considerable economies of scope. These economies of scope are broadly categorized into knowledge spillover benefits (benefits from transferring information and knowledge), and contractual economies (making better use of assets and/or safeguards already developed when contracting and ensuring quality in auditing) (Simunic, 1984; Beck et al., 1988; Arrunada, 1999). Both impairment of independence and economies of scope could co-exist although studies examined in this meta-analysis test for the effect of high NAF on impairment of independence irrespective of whether economies of scope is achieved. Narrative literature reviews on the consequences of NAS generally conclude that extant archival research on the effect of NAF on FRQ provides mixed evidence (Beattie and Fearnley, 2002; Schneider et al., 2006; Anandarajan et al., 2010).
Although narrative literature reviews may include a large number of studies on particular research themes, such reviews can be misleading and often inconclusive (Hunter and Schmidt, 1990). In some cases there may be several studies with varying results that are subject to variations in sample size, time period, and setting of the study. As a result, different researchers may reach different conclusions about a set of individual studies. A narrative literature review will report these as apparently inconsistent results and call for further research, which may also produce inconsistent results and further cloud the issue. By contrast, meta-analysis statistically aggregates results across individual studies and corrects for statistical artefacts like sampling and measurement error and, thereby, provides much greater precision with respect to the findings compared with narrative reviews (Hay et al., 2006). Meta-analysis is particularly effective in reconciling results that are inconsistent across studies like the effect of NAF on FRQ.
This meta-analysis is expected to contribute to the debate concerning whether regulation restricting the provision of NAS is justifiable. The recent most comprehensive corporate governance law in the form of the Sarbanes-Oxley Act 2002 (hereafter SOX-2002) prohibited nine specific NAS. The drafters of this regulation relied on an important piece of academic research conducted by Frankel et al. (2002), who document that firm-level absolute discretionary accruals and the probability of meeting and or beating analyst forecasts is positively associated with the level of NAF.2 However, subsequent studies using different research methodologies fail to support Frankel et al.'s findings (Ashbaugh et al., 2003; Chung and Kallapur, 2003). These contradictory findings call for an overall assessment of the state of the current research on the association between NAF and FRQ. Such an assessment is expected to inform regulators who are in the process of considering whether to impose sanctions on the provision of NAS. Francis (2006, p. 748) notes, ‘it is clear that the appropriateness of auditor-provided NAS [non-audit services] continues to be controversial and viewed with skepticism by regulators. For this reason, it is important that there be ongoing research to facilitate well informed policy making by regulators with respect to the costs and benefits of restricting the scope of NAS to audit clients’ (italics added).
Prospective investors can use reliable meta-analysis results to assess the risk of making investment decisions based on accounting information audited by auditors who may have compromised their independence and, hence, the integrity of the financial statements because of the joint provision of audit and NAS. The importance of conducting meta-analysis is further justified because there has been less such research in the accounting discipline. Pomeroy and Thornton (2008) identify 14 previous meta-analyses in the accounting discipline on internal control judgment (Trotman and Wood, 1991), accountants' job satisfaction (Brierley, 1999), corporate characteristics and disclosure levels (Ahmed and Courtis, 1999), corporate social and financial performance (Orlitzky et al., 2003), and audit fees (Hay et al., 2006).
A recent meta-analysis by Lin and Hwang (2010) integrates results from 48 prior studies on the determinants of earnings management, and provides evidence that NAF is positively associated with corporate earnings management based on a sub-set of ten studies. This meta-analysis differs from Lin and Hwang in three important respects. First, this meta-analysis includes eight studies that provide evidence on the association between NAF and market perception of accounting earnings. This is particularly important because there is evidence to suggest that NAF is negatively associated with earnings response coefficients (hereafter ERC) although the evidence on auditor independence in fact is rather mixed (Beattie and Fearnley, 2002). Lin and Hwang (2010) do not include these studies because their dependent variable is some form of earnings management measures. Second, this study broadens the definition of FRQ by incorporating auditor reporting decisions as an important construct of the FRQ. The literature on the association between NAF and reporting decisions has produced very mixed evidence and is a good testing ground for meta-analysis. Third, the sample studies selected for this meta-analysis significantly outnumber Lin and Hwang's sample (45 versus 10 studies, respectively).
Following the meta-analysis procedures outlined in Lipsey and Wilson (2001), this study provides statistical evidence that client-provided NAF is associated with reduced FRQ, as is evident from an average effect size (hereafter ES) of 0.021 which is statistically significant at better than the 1% level. Further analysis decomposing the FRQ proxies into individual components reveals that earnings management increases and auditor's propensity to issue qualified audit opinion decreases with the level of NAF indicating impairment of independence in fact. Importantly, meta-analysis of the market-based studies reveals that ERC is negatively associated with the level of NAF, indicating a lack of independence in appearance because of the joint provision of audit and NAS.
1: NAS FEES AND AUDITOR INDEPENDENCE
An important theoretical rationale for the impairment of auditor independence caused by joint provision of audit and NAS was provided by DeAngelo (1981). She coined the term ‘quasi rent’ in the context of audit fee-setting and used this concept to theorize auditor independence. Incumbent auditors enjoy a comparative cost advantage over new auditors because the latter group must incur technological start-up costs with each audit. Clients also incur auditor switching costs. Both the technological and switching cost advantages allow the incumbent auditor to set the audit fees in a manner that generates future quasi rents for the incumbent auditors. Both the client and the incumbent auditor can impose real costs on the other. For example, the auditor could demand higher audit fees and the client could ask for clean audit opinion. The existence of future quasi rents, therefore, weakens the auditors' incentives to be independent (DeAngelo, 1981). Subsequent empirical research relied on this theoretical notion to test the effect of this quasi rent in the form of NAF on FRQ.
However, opponents of DeAngelo's (1981) theory argue that the provision of NAS actually increases audit efficiency. This argument is derived from the conjecture that providing both audit and NAS provides considerable economies of scope. These economies of scope are broadly categorized into knowledge spillover benefits (benefits from transferring information and knowledge) and contractual economies (making better use of assets and/or safeguards already developed when contracting and ensuring quality in auditing) (Simunic, 1984; Beck et al., 1988., Arrunada, 1999; Beattie and Fearnley, 2002). Simunic (1984) interprets the efficiencies from joint production of audit and NAS as follows:
while efficiencies from joint production may exist, this does not imply that joint performance of MAS and auditing is necessarily desirable. Efficiencies can be partially appropriated as rents to the CPA firm supplier, and hence can themselves create a threat to independence. The degree of competition among CPA firms is therefore a critical factor in the problem. (p. 681)
Beck et al. (1988) distinguish NAS into recurring and non-recurring services and propose that recurring NAS give rise to knowledge spillovers and reduce the threat to independence, while the opposite effect is experienced with non-recurring NAS. Another argument for non-impairment of independence is advanced by Arrunada (1999), who suggests that the provision of NAS can also increase the auditor's investment in reputational capital, which the auditor is not likely to jeopardize to satisfy the demand of one particular client (Arrunada, 1999).
A number of narrative literature reviews have summarized the current state of archival research on the effect of NAF on FRQ. For example, Beattie and Fearnley (2002) and Schneider et al. (2006) review empirical studies of (a) the nature and magnitude of NAS fees; (b) the determinants of NAS purchasing decisions; (c) the impact of joint services provision from survey and experimental studies; (d) the association between joint provision, pricing and audit tenure; (e) the impact of NAS on auditor reporting decisions; and (f) the association between NAF and earnings quality. A brief summary of the review literature follows.
Beattie and Fearnley (2002) conclude from a comprehensive literature review that there is very little clear support for the view that joint provision of audit and NAS impairs independence in fact. They attribute this mixed evidence to the measurement of the proxy variable for auditor independence and the validity of the proxy itself. However, there is evidence to suggest that joint provision of audit and NAS adversely affects user perceptions. With respect to NAF and earnings quality, Schneider et al. (2006) conclude that extant archival studies provide some evidence consistent with NAF reducing earnings quality, but the findings are limited to specific circumstances and hence may not be generalizable. For audit opinion decisions, the authors conclude that U.S. studies provides little evidence that NAS reduces auditor independence. Evidence from Australia and the U.K. is, however, mixed and the differences in sample composition, reporting environment and the type of NAS provided may be responsible for these mixed findings. Anandarajan et al. (2010) conclude that existing research has provided mixed evidence on whether the provision of NAS impairs auditor independence. The authors raise concerns on the findings of the extant research since the research has unambiguously used NAS as the surrogate for impairment of independence. Survey evidence seems to suggest that a higher level of NAS is not necessarily perceived as causing impairment of independence (Dopuch et al., 2003). Ikin (2005) concludes that archival research provides no convincing evidence that high level of NAS impairs auditor independence in fact, as measured by auditor tenure, earnings management or earnings conservatism. However, the perception-based academic evidence seems to suggest that financial statement users perceive that NAS impairs auditor independence.
Figure 1 provides a schematic representation of the consequences of NAS categorized primarily into knowledge spillover and impairment of independence effects.3 Researchers have used three primary research methodologies, namely survey-based, experimental and large-sample archival research approaches to investigate the consequences of NAS.4 Archival research methodology has the advantage of‘incorporating all the richness of a capital market and its participants’ actual behaviour; however, a general weakness of archival studies is that the richness of the environment frequently makes it difficult, if not impossible, to pinpoint specific characteristics or situations' (Schneider et al., 2006, p. 205). It is evident from Figure 1 that the FRQ proxies used in the archival research are quite heterogeneous. Similarly, there is also disagreement among researchers regarding which NAF proxy better captures the economic dependence of incumbent auditors on the clients. The variables most frequently used to measure the importance of NAS to the auditing firm are the fee ratio (the ratio of NAF to total fees) and total fees (the sum of NAF and audit fees) paid to the external auditor; others include fee measures that adjust the amounts by client to construct a proxy for the client's importance to the auditor and percentile ranks, by auditor, of a firm's non-audit and audit fees.

DOMAIN OF META-ANALYSIS
Archival research uses an agency theory explanation for investigating the effect of NAF on FRQ and argues that auditors are rational wealth maximizers who would be intentionally biased towards compromising audit quality in order to generate wealth for themselves. Behavioural literature, on the other hand, suggests that psychological heuristics unconsciously lead auditors to make biased judgments. Although the cause of auditor bias differs in these two streams of literature, the ultimate effect remains the same: ‘auditors are more likely to acquiesce to client pressure, including pressure to allow earnings management, when the provision of nonaudit services generates economic rents’ (Frankel et al., 2002, p. 75). Frankel et al. (2002) report a positive association between NAF and earnings management consistent with this hypothesis. Their finding, however, has been challenged by at least three other subsequent papers (Ashbaugh et al., 2003; Chung and Kallapur, 2003; Reynolds et al., 2004). Francis (2004) offers a possible explanation for this stream of replicated research:
The Frankel et al. (2002) paper hints at some political dimensions to accounting research. The accounting establishment was upset by the Frankel et al. study, and I believe there was some sympathy within the academic community to publish papers refuting their findings. Top-level accounting research journals do not generally publish replications or ‘no results’ studies, yet that is what has occurred in the non-audit service areas. (p. 357)
Another FRQ proxy that received significant research attention is the impact of NAF on audit reporting decisions. If the joint provision of audit and NAS impairs auditor independence then it is likely that auditors will issue far more unqualified audit reports than warranted. In an early study, Wines (1994) reports that NAF is negatively associated with audit qualification. Barkess and Simnett (1994) and Craswell (1999), however, fail to find any significant association between the two. But Sharma and Sidhu (2001) support Wines (1994) by focusing on a sample of distressed firms. Researchers have also used earnings conservatism, restatements, and market valuation of accounting earnings as useful FRQ proxies. The following section develops testable hypotheses on the association between NAF and FRQ proxies and attempts to provide a critical evaluation of the FRQ proxies.
2: DEVELOPMENT OF TESTABLE HYPOTHESES
Although FRQ occupies the central role in the information provision to outsiders, the precise definition of FRQ remains elusive. In considering FRQ, the overwhelming majority of research has focused on earnings quality. Earnings are widely used as a key performance indicator of business success, commonly employed in compensation and debt arrangements. A recent comprehensive survey of chief financial officers by Graham et al. (2005) show the GAAP earnings number, especially the earnings per share (EPS), is the key metric upon which the market focuses. This is mainly because, for evaluating a firm's performance, investors need a simple benchmark, like EPS, that reduces the costs of information processing (Graham et al., 2005, p. 21). Also, valuation theory has long posited a relationship between earnings and the value of common stock (Graham et al., 1962; Miller and Modigliani 1966). The most widely used earnings quality measure is some form of abnormal accruals measure as estimated by popular models like Jones (1991) and extended by Dechow et al. (1995). Following the agency-theory relationship developed earlier, auditors are more likely to acquiesce to client pressure, including pressure to allow earnings management, when the provision of NASs generates economic rents.5 Therefore, the following hypothesis is formulated:
-
H1: The level of absolute abnormal accruals is positively associated with the level of NAF.6
However the reliability of the abnormal accruals models used in the literature is questionable. Unfortunately there is no best accruals model available and researchers usually validate the robustness of the findings by using alternative accruals models (Dechow et al., 2003). Another major problem with using abnormal accruals as a proxy for opportunistic earnings management is that managers could also use discretionary accruals as an efficient contracting device to provide credible signals to the marketplace about the future growth prospects of the organizations. However, earnings management-based accounting literature generally theorizes opportunistic motives for using abnormal accruals. Furthermore, managers could manipulate accounting information using a combination of accruals management, real activities manipulation (Roychowdhury, 2006), and classification shifting mechanisms (McVay, 2006).
Auditor provided qualified audit opinion has been used as another proxy for the FRQ. Unlike earnings management research which reflects actual reported numbers, qualified audit opinion is provided by the auditors by taking into consideration a large number of factors including quality of earnings before issuing such opinions. Issuance of such a qualified opinion is a matter of considerable importance to the auditors because such opinions could lead to company bankruptcy (the so-called self-fulfilling prophecy feature). The literature on the association between NAF and auditors' propensity to issue qualified audit opinions provides very mixed evidence. For example, two early Australian studies (Wines, 1994; Sharma and Shidhu, 2001), report a negative association between the two whereas Barkess and Simnett (1994) and Craswell (1999) find no significant association between NAF and qualified audit opinion decisions. None of the U.S. studies provide any evidence of an adverse effect of NAF on qualified audit opinion decisions; if anything there seems to be evidence that provision of tax services is associated with more accurate qualified audit opinion decisions (Robinson, 2008). The following testable hypothesis is developed to assess the effect of NAF on the propensity of auditors to issue qualified audit opinions:
-
H2: The propensity to issue qualified audit opinions is negatively associated with the level of NAF.
FRQ has also been proxied by earnings conservatism and earnings restatement. Researchers use the notion of conservatism to imply conditional conservatism which requires a higher degree of verification for recognizing good news than bad news in financial statements (Basu, 1997). This timely loss recognition property mitigates agency problems between shareholders and managers (Kothari et al., 2010). The choice of conditional conservatism as an earnings quality measure is appropriate in an audit setting. Because published financial statements are a joint product of managers' representations and the audit process, earnings quality represented by the conditional conservatism property would also be expected to reflect audit quality (Ruddock et al., 2005). Because NAF are widely believed to contain higher profit margins than traditional audit services, auditors may be less inclined to issue conditionally conservative financial statements and hence compromise the integrity of financial statements with the level of NAF. Therefore the following hypothesis is formulated:
-
H3: There is a negative association between NAF and earnings conservatism.
The conditional conservatism measure, however, has been challenged from different quarters. One common argument against this measure is that it proxies for some other properties of firms and/or financial reporting because of a negative relation between the conservatism measure and the market-to-book ratio and, hence, biases the asymmetric timeliness coefficient. Another serious concern is raised by Dietrich et al. (2007) who argue that the model is mis-specified, and produces biased coefficients since the model uses earnings as the dependent variable rather than the stock returns which is conventional in earnings response coefficient research. However, Ball et al. (2009) provide an econometric analysis of the Basu (1997) conservatism measure and conclude that these challenges are misconstrued.
With respect to earnings restatement, The U.S. General Accounting Office (GAO, 2002) documents a total of 919 restatements from January 1997 to June 2002. Academic research reveals that restating firms lose about 10% of the market value on the day of such announcements (Palmrose et al., 2004). Efendi et al. (2007) report a positive association between stock-based compensation arrangements and subsequent earnings restatements. If auditors compromise their independence because of the joint provision of audit and NAS, then these earnings manipulation and/or violation of GAAP transactions will go unreported, and lead to lower quality audits and reviews. Some of these violations are likely to be revealed through restatements of financial statements (Kinney et al., 2004). The following hypothesis is, therefore, developed:
-
H4: Earnings restatements increase with the level of NAF.
The FRQ variables described above are all used in academic research to investigate the impact of NAF on the impairment of independence in fact. To examine independence in appearance, archival research has examined the market valuation of accounting earnings (conventional ERC) conditional on NAF. The underlying premise is that if investors perceive audit independence to be impaired because of NAF, ‘[they] will place less reliance on the auditors’ attestation of its client's financial statements, and perceive greater noise in the client's reported earnings' (Krishnan et al., 2005). The following hypothesis tests this conjecture:
-
H5: The ERC is negatively associated with the level of NAF.
The major problem associated with using capital market studies is the fact that the stock market is flooded with information on a continuous basis. It is difficult to control precisely for alternative information that could potentially confound the impact of NAF disclosures on stock returns.
3: METHODS
3.1: Search for Relevant Studies
I performed an exhaustive search via ABN-INFO, SSRN, existing literature reviews, and internet sources to identify potential studies to be included for this meta-analysis. One important consideration for any meta-analysis is whether unpublished working papers should be included along with published studies. I excluded working paper in the main analysis because (a) the papers have not been adequately vetted by the review process; (b) it is difficult to identify all working papers so as to eliminate the sample selection bias; or (c) unpublished papers may be subsequently published. Such exclusion, however, can result in publication bias because there is a tendency for studies with significant results or larger ES to be more likely to be published while those with insignificant results are not. This is commonly known as ‘file drawer’ problem and will require the calculation of a fail-safe number to rule out the publication bias explanation for the findings. The procedure is explained in section 3.3. The combined search resulted in 45 papers and nearly 108,000 sample observations summarized in Table 1. The bulk of the papers in Table 1 address the effect of NAF on earnings management (14 studies), qualified audit opinions and auditor ratification (16 studies) and cost of debt and market valuation of corporate earnings (ERC) studies (9 studies). Thirty-eight of these 45 studies use FEERATIO as the primary independent NAF construct vis-à-vis FRQ with a resulting sample observation of 92,511.
Author and year | Sample size | Country | Dependent variable | Independent variable | Statistics | Findings |
---|---|---|---|---|---|---|
NAF and earnings management (common proxy DACCR) | ||||||
Frankel et al. (2002) | 2,472 firm-years | U.S.A. | DACCR a | FEERATIO | 0.01 | Positive and significant |
2,012 firm-years | Meet/beat forecasts | FEERATIO | 0.01 | Positive and significant | ||
2,092 firm-years | Small earnings increase | FEERATIO | 0.72 | Negative but insignificant | ||
Ashbaugh et al. (2003) | 3,069 firm-years | U.S.A. | |DCACCR|b | FEERATIO | 0.02 | Positive and significant |
1,666 firm-years | Meet/beat forecasts | FEERATIO | 0.24 | Positive and significant | ||
2,758 firm-years | Small earnings increase | FEERATIO | 0.66 | Negative but insignificant | ||
Chung and Kallapur (2003) | 1,853 firm-years | U.S.A. | |DACCR| | FEERATIO | 1.52 | Positive but insignificant |
Larcker and Richardson (2004) | 5,103 firm-years | U.S.A. | |DACCR| | FEERATIO | 1.69 | Positive and significant |
TOTFEE | −6.21 | Negative and significant | ||||
Reynolds et al. (2004) | 2,507 companies | U.S.A. | |DACCR| | FEERATIO | 0.23 | Negative and insignificant |
TOTFEE | 0.35 | Positive but insignificant | ||||
Ferguson et al. (2004) | 610 firm-years | U.K. | |DCACCR| | FEERATIO | 0.001 | Positive and significant |
NAFEE | 0.044 | Positive and significant | ||||
Antle et al. (2006) | 2,294 firm-years | U.K. | |DACCR| | FEERATIO | 2.47 | Positive and significant |
LOGAF | 3.88 | Positive and significant | ||||
LOGNAF | −2.93 | Negative and significant | ||||
Dee et al. (2006) | 384 firms | U.S.A. | Performance adjusted |DCACCR| | FEERATIO | 0.60 | Negative and insignificant |
TOTFEE | 0.022 | Positive and significant | ||||
Huang et al. (2007) | 6,891 firm-years | U.S.A. | Performance adjusted CACCR+ | TAXRATIO c | 0.12 | Negative but insignificant |
6,722 firm-years | Meet/beat forecasts | TAXRATIO | 0.69 | Positive but insignificant | ||
3,361 firm-years | Small earnings increase | TAXRATIO | 0.37 | Negative but insignificant | ||
Habib and Islam (2007) | 530 firm-years | Bangladesh | DACCR | FEERATIO | 0.47 | Negative but insignificant |
100 firm-years | Loss avoidance | FEERATIO | −1.20 | Negative but insignificant | ||
150 firm-years | Small earnings increase | FEERATIO | 0.57 | Positive but insignificant | ||
Mitra (2007) | 71 firms (2000) | U.S.A. | |DACCR| | FEERATIO | 0.24 | Positive but insignificant |
TOTFEE | 0.31 | Positive but insignificant | ||||
Srinidhi and Gul (2007) | 4,282 firm-years | U.S.A. | DD(2002) accrual quality measure | FEERATIO | 0.01 | Positive and significant |
TOTFEE | 0.05 | Positive and significant | ||||
NAFEE | 0.01 | Positive and significant | ||||
Gul et al. (2007) | 4,720 firm-years | U.S.A. | Performance adjusted DACCR | FEERATIO | -d | Positive but insignificant |
4,720 firm-years | LTOTFEE | 0.05 | Negative and significant | |||
4,720 firm-years | LNAF | -d | Negative but insignificant | |||
Lim and Tan (2008) | 4,943 firm-years | U.S.A. | DCAACR | LANF | 1.96 | Positive and significant |
PRNAU | 1.40 | Positive but insignificant | ||||
LTOT | 0.65 | Positive but insignificant | ||||
3,498 firm-years | Meet forecasts | PRNAU | 1.26 | Positive but insignificant |
NAF and audit opinion/auditor ratification decisions | ||||||
---|---|---|---|---|---|---|
Wines (1994) | 76 listed firms | Australia | Audit qualifications | FEERATIO | −1.83 | Negative and significant |
Craswell (1999) | 1,145 firm-years | Australia | Qualified audit report = 1 | FEERATIO | 1.01 | Positive but insignificant |
Lennox (1999) | 2,266 firm-years | U.K. | Qualified audit report = 1 | FEERATIO | 2.05 | Positive and significant |
Sharma and Shidhu (2001) | 49 bankrupt firms | Australia | GC opinion = 1 | FEERATIO | 0.049 | Negative and significant |
DeFond et al. (2002) | 1,158 distressed firms | U.S.A. | GC opinion = 1 | FEERATIO | 0.49 | Negative but insignificant |
TOTFEE | 0.12 | Negative but insignificant | ||||
Firth (2002) | 1,112 firm-years | U.K. | Qualified opinion = 1 | FEERATIO | −1.96 | Negative and significant |
Geiger and Rama (2003) | 132 firms | U.S.A. | GC modified audit opinions = 1 | NASFEE | 0.215 | Negative but insignificant |
Hay et al. (2006) | 643 firm-years | New Zealand | Audit qualifications | FEERATIO | 0.52 | Negative but insignificant |
Robinson (2008) | 209 firms | U.S.A. | GC opinion = 1 | TAXRATIO | 0.05 | Positive and significant |
Fargher and Jiang (2008) | 5,113 firm-years | Australia | GC|PGC = indicator variable coded 1 if a client receives actual GC given a PGC | FEERATIO | 0.003 | Negative and significant |
Basioudis et al. (2008) | 58 firms | U.K. | GC opinion = 1 | NASFEE | 0.013 | Negative and significant |
Lim and Tan (2008) | 1,692 firm-years | U.S.A. | GC opinion = 1, zero otherwise. | LNAF | 2.33 | Positive and significant |
FEERATIO | 1.63 | |||||
LTOT | 8.31 | |||||
Callaghan et al. (2009) | 92 firms | U.S.A. | GC opinion = 1 | FEERATIO | 0.51 | Positive and insignificant |
TOTFEE | 0.83 | Positive and insignificant | ||||
Raghunandan (2003) | 172 firm-years | U.S.A. | Proportion of shares voted against and abstained from the selection of the auditor. | FEERATIO | 3.92 | Positive and significant |
Mishra et al. (2005) | 248 firms | U.S.A. | Vote: Percent of votes against auditor | TAXRATIO | 7.78 | Positive and significant |
Liu et al. (2009) | 194 firm-years | U.S.A. | Proportion of shares voted against and abstained from the selection of the auditor. | FEERATIO | 3.60 | Positive and significant |
NAF and Earnings conservatism | ||||||
---|---|---|---|---|---|---|
Ruddock et al. (2005) | 3,746 firm-years | Australia | News-dependent conservatism measure of Basu (1997) | FEERATIO | 1.23 | Positive but not significant |
Zhang and Emanuel (2008) | 352 firm-years | New Zealand | News-dependent conservatism measure of Basu (1997) | FEERATIO | −0.95 | Negative but insignificant |
NAF and Earnings restatement | ||||||
---|---|---|---|---|---|---|
Kinney et al. (2004) | 934 firm years | U.S.A. | RESTATE= 1 if restated, 0 otherwise. | TAXRATIO | 0.04 | Negative and significant |
Agrawal and Chadha (2005) | 318 firm-years | U.S.A. | RESTATE= 1 if restated, 0 otherwise. | FEERATIO | 0.15 | Positive but insignificant |
Li and Lin (2005) | 351-firm years | U.S.A. | RESTATE= 1 if restated, 0 otherwise. | FEERATIO | 0.93 | Positive but insignificant |
LOGTF | 0.002 | Positive and significant | ||||
LOGNAF | 0.66 | Positive but insignificant | ||||
Bloomfield and Shackman (2008) | 500 firm-years | U.S.A. | Restatement probability | FEERATIO | −1.86 | Negative and significant |
NAF and capital market studies | ||||||
---|---|---|---|---|---|---|
Brandon et al. (2004) | 333 bond issuers | U.S.A. | Moody's bond rating | FEERATIO | 0.023 | Negative and significant |
TOTFEE | 0.01 | Negative and significant | ||||
Dhaliwal et al. (2008) | 560 bond issues | U.S.A. | TSPREAD: the yield on the first bond issue | FEERATIO | 2.11 | Positive and significant |
TOTFEE | 2.45 | Positive and significant | ||||
Krishnan et al. (2005) | 2,816 firm-years | U.S.A. | Three-day market-adjusted abnormal returns (CAR) | FEERATIO | −3.22 | Negative and significant |
−1.68 | Negative and significant | |||||
Gul et al. (2006) | 840 firm-years | Australia | RTN: Holding returns from 9 months before the balance sheet date to 3 months after. | FEERATIO | −2.46 | Negative and significant |
Francis and Ke (2006) | 16,243 firm-years | U.S.A. | Cumulative abnormal returns (CAR) | FEERATIO | −3.96 | Negative and significant |
Chin et al. (2007) | 254 firm-years | Taiwan | Forecast bias | FEERATIO | 2.46 | FEERATIO is positively associated with optimistic and inaccurate forecasts |
Forecast accuracy | ||||||
Lim and Tan (2008) | 2,935 firm-years | U.S.A. | CAR | LANF | −2.95 | Negative and significant |
PRNAF | −2.92 | Negative and significant | ||||
LTOT | −3.16 | Negative and significant | ||||
Niu (2008) | 814 firm-years | Canada | RTN: Holding returns from 9 months before the balance sheet date to 3 months after. | FEERATIO | −0.43 | Negative but insignificant |
Ghosh et al. (2009) e | 21,762 firm-years (2001-06) | U.S.A. | CAR | FEERATIO | 1.03 f | Positive but insignificant |
TOTFEE | −2.25 | Negative and significant |
- Notes: Bold and italicized reported statistics are t-values while the remaining are p-values.
- a I use absolute DACCR (|DACCR|) as the relevant FRQ proxy because companies could engage in both upward as well as downward earnings management. In addition to the DACCR, many studies also include earnings benchmarks, e.g., ‘loss avoidance', ‘small increase in earnings’ and ‘meeting and or beating analyst forecasts’ as the other earnings management proxies. I provide a sensitivity analysis using four such earnings management studies to examine the effect of NAF on these alternative earnings management proxies.
- b Ashbaugh et al. (2003) replicate Frankel et al. (2002) but model |DACCR| as performance-adjusted DACCR.
- c Huang et al. (2007) break down FEERATIO into audit-related fee ratio (AFRATIO), tax ratio (TAXRATIO) and other fee ratio (OTHRATIO). I include only TAXRATIO as the appropriate fee construct because descriptive statistics in Huang et al. (2007) Table 1 reveals that TAXRATIO is the largest component of FEERATIO. Additionally, PCAOB rules restrict (a) auditors' involvement in certain aggressive tax-planning transactions, (b) tax services provided to a company's senior financial management, and (c) new procedures for audit committee preapproval.
- d Gul et al. (2007) do not provide any significance level for the FEERATIO and LNAF and therefore I could not calculate the relevant ES.
- e The working paper version of this study used data from 2000 to 2002 and found a negative association between market returns and earnings conditional on NAF level. The published version uses a much larger sample from a more recent data period and reports a positive but insignificant association which leads them to conclude that NAF level is not associated with impairment of independence in appearance.
- f The coefficient on %NAF/TF is negative and statistically significant at better than the 1% level (coefficient −0.005, t-statistics −3.22), but this coefficient in itself does not answer the question of whether NAF is associated with a lack of independence in appearance.
3.2: Criteria for Relevance
The studies included for this meta-analysis had the following characteristics. First, the studies examined the relationship between NAF and FRQ quantitatively. The reported statistics did not have to be a Pearson's product-moment correlation r, but could also be a t-statistic or p-value. Second, the studies unambiguously specified the component of the FRQ they were trying to measure. Lim and Tan (2008) examine earnings management, audit opinion and ERC in the same paper so there is an overlapping of samples. I treat all these sub-sample analyses as separate studies because no combined FRQ result is presented.
3.3: Meta-analysis Procedures
The purpose of this meta-analysis is to determine (a) the net effect of NAF on FRQ; and (b) whether there is homogeneity in the FRQ proxies used in the extant literature. The following standard meta-analysis procedures as suggested by Lipsey and Wilson (2001) are performed to fulfil these objectives.
- 1
Step 1: Convert test statistics, such as p-values or t-statistics reported in different studies to a standard correlation measure called effect size (ES). The majority of the studies canvassed for meta-analysis in this paper reported p-values. Reported p-values are converted into t-statistics, and ES then is calculated as follows:
where, t is the reported or converted t-statistics and df is the degree of freedom calculated as (n− 3) where n is the number of firm-years observations for each studies.()
-
Step 2: Compute mean ES using weighting factors based on assumptions regarding the homogeneity of variances across studies.
-
Step 3: Test the homogeneity assumptions in step 2 above.
While conducting this meta-analysis a number of adjustments had to be made. First, the selected studies differ with respect to the proxy variable chosen to represent auditor independence. The most widely used proxy is the fee ratio (FEERATIO) defined as NAF divided by sum of audit and NAF for a particular client. This ratio has been used in earlier NAF-related research (Scheiner, 1984; Parkash and Venable, 1993; Barkess and Simnett, 1994; Firth, 1997) and is also consistent with the SEC (2000) position that this measure is useful for investors to gauge auditor independence. Another argument for using FEERATIO as a measure to assess auditor independence ‘could be that there are only certain specific types of nonaudit services that impair an auditor's independence . . . As the ratio of nonaudit to audit fee increases, it could become more likely that the nonaudit services provided include one of those that impair independence’ (Chung and Kallapur, 2003, p. 949). Table 2 Panels A and B provide meta-analysis evidence on the association between FRQ and the level of NAF using FEERATIO as the independence proxy based on 38 studies.
Panel A: ES calculation and significance of the studies using FEERATIO as the independence proxy | ||||||||
---|---|---|---|---|---|---|---|---|
FRQ | Author | Stat. value | N | ES | ESz | W (df) | W*ES | W*ES2 |
EM | Frankel et al. (2002) | 0.01 | 2,472 | 0.05 | 0.05 | 2,469 | 123.45 | 6.17 |
Ashbaugh et al. (2003) | 0.02 | 3,069 | 0.04 | 0.04 | 3,066 | 122.64 | 4.91 | |
Chung and Kallapur (2003) | 1.52 | 1,853 | 0.04 | 0.04 | 1,850 | 74.00 | 2.96 | |
Larcker and Richardson (2004) | 1.69 | 5,103 | 0.02 | 0.02 | 5,100 | 102.00 | 2.04 | |
Reynolds et al. (2004) | 0.233 | 2,507 | −0.02 | −0.02 | 2,504 | −50.08 | 1.00 | |
Ferguson et al. (2004) | 0.001 | 610 | 0.13 | 0.13 | 607 | 78.91 | 10.25 | |
Antle et al.(2006) | 2.47 | 2,294 | 0.05 | 0.05 | 2,291 | 114.55 | 5.73 | |
Dee et al. (2006) | 0.60 | 384 | −0.027 | −0.03 | 381 | −11.43 | 0.28 | |
Habib and Islam (2007) | 0.47 | 530 | −0.03 | −0.03 | 527 | −15.81 | 0.47 | |
Mitra (2007) | 1.20 | 71 | 0.14 | 0.14 | 68 | 9.52 | 1.33 | |
Srinidhi and Gul (2007) | 0.01 | 4,282 | 0.04 | 0.04 | 4,279 | 171.16 | 6.85 | |
Lim and Tan (2008) | 1.40 | 4,943 | 0.02 | 0.02 | 4,940 | 98.80 | 1.98 | |
Audit opinion | Wines (1994) | −1.83 | 76 | 0.21 | 0.21 | 73 | 15.33 | 3.21 |
Craswell (1997) | 1.01 | 1,145 | −0.03 | −0.03 | 1,142 | −34.26 | 1.03 | |
Lennox (1999) | 2.05 | 2,244 | −0.05 | −0.05 | 2,241 | −112.05 | 5.60 | |
Sharma and Shidhu (2001) | 0.046 | 49 | 0.29 | 0.3 | 46 | 13.80 | 3.87 | |
DeFond et al. (2002) | 0.49 | 1,158 | 0.02 | 0.02 | 1,155 | 23.10 | 0.46 | |
Firth (2002) | −1.96 | 1,112 | 0.06 | 0.06 | 1,109 | 66.54 | 3.99 | |
Raghunandan (2003) | 3.92 | 300 | 0.22 | 0.22 | 297 | 65.34 | 14.38 | |
Hay et al. (2006) | 0.52 | 643 | 0.03 | 0.03 | 640 | 19.20 | 0.576 | |
Fargher and Jiang (2008) | 0.003 | 5,113 | 0.04 | 0.04 | 5,110 | 204.40 | 8.176 | |
Lim and Tan (2008) | 1.61 | 1,692 | −0.039 | −0.04 | 1,689 | −67.56 | 2.57 | |
Callaghan et al. (2009) | 0.51 | 92 | −0.07 | −0.07 | 89 | −6.23 | 0.44 | |
Liu et al. (2009) | 3.60 | 194 | 0.25 | 0.26 | 191 | 49.66 | 11.94 | |
Earnings conservatism | Ruddock et al. (2005) | 1.23 | 2,497 | 0.024 | 0.02 | 2,494 | 49.88 | 1.44 |
Zhang and Emanuel (2008) | −0.95 | 352 | −0.05 | −0.053 | 349 | −18.78 | 0.87 | |
Earnings restatement | Agrawal and Chadha (2005) | 1.44 | 318 | 0.08 | 0.08 | 315 | 25.2 | 2.02 |
Li and Lin (2005) | 0.82 | 351 | 0.0439 | 0.044 | 348 | 15.31 | 0.67 | |
Bloomfield and Shackman (2008) | −1.86 | 500 | −0.08 | −0.082 | 497 | −39.86 | 3.18 | |
Capital market valuation | Brandon et al. (2004) | 0.023 | 333 | 0.12 | 0.13 | 330 | 42.9 | 4.75 |
Krishnan et al. (2005) | −3.22 | 2,816 | 0.06 | 0.06 | 2,813 | 168.78 | 10.13 | |
Gul et al. (2006) | −2.46 | 840 | 0.08 | 0.08 | 837 | 66.96 | 5.36 | |
Francis and Ke (2006) | −3.96 | 16,243 | 0.03 | 0.03 | 16,240 | 487.2 | 14.62 | |
Chin et al. (2007) | 2.46 | 254 | 0.15 | 0.15 | 251 | 37.65 | 5.65 | |
Dhaliwal et al. (2008) | 2.11 | 560 | 0.09 | 0.09 | 557 | 50.13 | 4.51 | |
Lim and Tan (2008) | −2.92 | 2,935 | 0.05 | 0.05 | 2,932 | 146.6 | 7.33 | |
Ghosh et al. (2009) | 1.03 | 21,762 | −0.007 | −0.007 | 21,759 | −152.31 | 1.07 | |
Niu (2008) | −0.43 | 814 | 0.02 | 0.02 | 811 | 16.22 | 0.32 | |
92,511 | 92,397 | 1,951 | 162.13 | |||||
Mean ES | 0.021 | |||||||
Standard error (SE) | 0.0033 | |||||||
z-statistic | 6.37*** | |||||||
Lower bound | 0.015 | |||||||
Upper bound | 0.027 | |||||||
Homogeneity test (Q-values) | 118.33*** |
Panel B: Component of the FRQ proxies and ES analysis with FEERATIO as the primary independence proxy | |||||
---|---|---|---|---|---|
Earnings management (n= 12) | Audit opinion/ratification (n= 12) | Earnings conservatism (n= 2) | Earnings restatement (n= 3) | Capital market studies (n= 9) | |
Mean ES | 0.029 | 0.017 | 0.011 | 0.0006 | 0.019 |
SE | 0.0061 | 0.0085 | 0.019 | 0.0294 | 0.0046 |
z-statistic | 4.88*** | 2.02** | 0.583 | 0.019 | 4.01*** |
Lower limit | 0.017 | 0.0005 | −0.03 | −0.057 | 0.0095 |
Upper limit | 0.036 | 0.034 | 0.048 | 0.0581 | 0.028 |
Q-values | 20.16*** | 52.15*** | 1.97 | 5.87** | 37.68*** |
Within group Q | 117.83*** | ||||
[20.16+52.15+1.97+5.87+37.68] | |||||
Between group Q (Panel A) | 118.33*** |
- Note: The sample consists of 38 archival studies and 92,511 observations investigating the effect of NAF (proxied by FEERATIO) on five different FRQ proxies. ES is calculated using the following formula:
- The formula is straightforward to apply when the sample studies reported t-statistics. The majority of the studies canvassed for meta-analysis, however, reported p-values. Reported p-values are converted into t-statistics using p-value tables.
- Standard error is calculated using the following formula:
- Lower and upper bound of the ES are calculated as ES- (1.96*SE) and ES+ (1.96*SE) respectively.
- To test whether the various ESs that are averaged into a mean value, are estimating the same population ES, a Q-statistic, which is distributed as a chi-square with k−1 degrees of freedom where k is the number of ES, is calculated using the following formula (Lipsey and Wilson, 2001, p. 115).
-
Where effect sizei (ES) is the individual ES for i = 1 to k, and
is the weighted mean ES over the k ES, and wi is the individual weight for effect sizei.
- Note: Within group Q is distributed as a chi-square with 33 degrees of freedom (number of individual ES 38 minus 5 FRQ categories) and yields a critical value of 54.78 at 1% probability level. This rejects the hypothesis that the residual variability within each FRQ proxy is homogeneous. Between group Q is the reported Q in Panel A, i.e., 118.33. The difference between these two Q is not statistically significant at the critical level (critical value of Q with 4 degrees of freedom at 5% level is 9.49) and thereby rejects the presence of any between-group effect.
- *** and ** denote statistical significance at 1% and 5% level respectively (two-tailed test).
This measure, however, gives equal weight to small and large NAF whereas the incentives for auditors to compromise independence should be stronger when the size of NAF is larger. Ashbaugh et al. (2003), therefore, argue that FEERATIO does not capture the economic importance of the client when total client fees are immaterial to the audit firms and hence recommend using total fees (TOTFEE) as a more appropriate proxy to measure auditor's dependence. Table 3 Panels A and B provide meta-analysis evidence based on FEERATIO proxy. Other proxies include log of NAF, rank of NAF, and components of NAF, for example, audit-related fee ratio, tax ratio, and other fee ratios.7
Panel A: ES measures using total fees (TOTFEE) as the NAF proxy | ||||||||
---|---|---|---|---|---|---|---|---|
FRQ | Author | Stat. value | N | ES | ESz | W (df) | W*ESz | W*ES2 |
EM | Frankel et al. (2002) | 0.16 | 2,472 | 0.03 | 0.03 | 2,469 | 74.07 | 2.222 |
Ashbaugh et al. (2003) | 0.59 | 3,069 | 0.01 | 0.01 | 3,066 | 30.66 | 0.307 | |
Larcker and Richardson (2004) | −6.21 | 5,103 | −0.09 | −0.09 | 5,100 | −459.00 | 41.310 | |
Reynolds et al. (2004) | 0.35 | 2,507 | 0.02 | 0.02 | 2,504 | 50.08 | 1.002 | |
Ferguson (2004) | 0.04 | 610 | 0.08 | 0.08 | 607 | 50.08 | 3.89 | |
Dee et al. (2006) | 0.02 | 384 | 0.12 | 0.12 | 381 | 45.72 | 5.486 | |
Habib and Islam (2007) | 0.14 | 530 | 0.07 | 0.07 | 527 | 36.89 | 2.582 | |
Mitra (2007) | 0.31 | 71 | 0.13 | 0.13 | 68 | 8.84 | 1.149 | |
Srinidhi and Gul (2007) | 0.05 | 4,282 | 0.03 | 0.03 | 4,217 | 126.51 | 3.80 | |
Gul (2007) | 0.05 | 4,720 | −0.03 | −0.03 | 4,717 | −141.51 | 4.245 | |
Lim and Tan (2008) | 0.65 | 4,943 | 0.0092 | 0.0092 | 4,940 | 45.45 | 0.42 | |
Audit opinion | DeFond et al. (2002) | 0.12 | 1,158 | 0.05 | 0.05 | 1,155 | 57.75 | 2.89 |
Lim and Tan (2008) | 8.31 | 1,692 | −0.20 | −0.20 | 1,689 | −337.80 | 67.56 | |
Callaghan et al. (2009) | 0.83 | 92 | 0.03 | 0.03 | 89 | 2.67 | 0.08 | |
Capital market valuation | Brandon et al. (2004) | 0.001 | 333 | 0.18 | 0.18 | 330 | 60.06 | 10.69 |
Krishnan et al. (2005) | −1.68 | 2,816 | 0.03 | 0.03 | 2,813 | 84.39 | 2.53 | |
Chin et al. (2007) | 0.96 | 254 | 0.06 | 0.06 | 251 | 15.06 | 0.90 | |
Dhaliwal et al. (2008) | 2.45 | 560 | 0.10 | 0.10 | 557 | 55.70 | 5.57 | |
Ghosh et al. (2009) | −2.25 | 21,762 | 0.015 | 0.015 | 21,759 | 326.39 | 14.30 | |
Lim and Tan (2008) | −3.16 | 2,935 | 0.06 | 0.06 | 2,932 | 175.92 | 10.56 | |
Mean ES | 0.0051 | |||||||
SE | 0.004 | |||||||
z-statistic | 1.26 | |||||||
Lower bound | −0.0029 | |||||||
Upper bound | 0.013 | |||||||
Homogeneity test | 170.60*** |
Panel B: Component of FRQ proxies and ES analysis (TOTFEE as the independence proxy) | |||
---|---|---|---|
FRQ | Earnings management (11 studies) | Audit opinion/auditor ratification (3 studies) | Capital market studies (6 studies) |
Mean ES | −0.0046 | −0.09 | 0.025 |
SE | 0.0059 | 0.018 | 0.006 |
z-statistic | −0.78 | −5.12*** | 4.24*** |
Lower limit | −0.016 | −0.13 | 0.013 |
Upper limit | 0.007 | −0.06 | 0.037 |
Q-values | 65.85*** | 44.30*** | 17.17*** |
Within group Q | 127.32*** | ||
Between group Q (from Panel A) | 170.60*** |
- Note: The sample consists of 20 archival studies and 60,293 observations investigating the effect of NAF (proxied by TOTFEE) on five different FRQ proxies. ES is calculated using the following formula:
- The formula is straightforward to apply when the sample studies reported t-statistics. The majority of the studies canvassed for meta-analysis, however, reported p-values. Reported p-values are converted into t-statistics using p-value tables.
- Standard error is calculated using the following formula:
- Lower and upper bound of the ES are calculated as ES- (1.96*SE) and ES+ (1.96*SE) respectively.
- To test whether the various ESs that are averaged into a mean value, are estimating the same population ES, a Q-statistic, which is distributed as a chi-square with k− 1 degrees of freedom where k is the number of ES, is calculated using the following formula (Lipsey and Wilson, 2001, p. 115).
-
Where effect sizei (ES) is the individual ES for i = 1 to k, and
is the weighted mean ES over the k ES, and wi is the individual weight for effect sizei.
- Note: Within group Q is distributed as a chi-square with 17 degrees of freedom (number of individual ES 20 minus 3 FRQ categories) and is significantly higher than the critical Q-value of 33.4 at 1% probability level. This rejects the hypothesis that the residual variability within each FRQ proxy is homogeneous. Between group Q is the reported Q in Panel A, i.e., 170.6. The difference between these two Q is significantly higher than the critical Q-value of 9.21 (2 degrees of freedom at 1% level) and reveals the presence of between-group variability.
- *** denotes statistical significance at 1% level (two-tailed test).
Another difficulty associated with conducting meaningful meta-analysis relates to the heterogeneity of the FRQ proxies used. For example, when considering earnings management, some researchers used discretionary accruals in the primary analysis and the propensity of managers to meet or beat earnings targets (benchmark beating phenomena) in the supplementary analysis as proxies for FRQ (Frankel et al., 2002; Ashbaugh et al., 2003; Dee et al., 2006). This concern, however, is primarily limited to only four studies in the earnings management category that use DACCR and earnings benchmarks as alternative earnings management proxies. I provide a sensitivity analysis to assess the impact of these studies with multiple dependent variables on the overall results in section 5.1.
Finally, the ‘file drawer’ problem is tackled by calculating the fail-safe number, which reflects the number of studies failing to report significant results that would be required to reverse a conclusion about a significant relationship between the dependent and independent variables. The number is calculated as follows (Rosenthal, 1991, p. 261):
- 1
I converted all t-statistics into their respective p-values and then all p-values are converted into z-statistics. Individual z-statistics are then combined using the following formula:
Where N is the number of studies included in the meta-analysis and Z is the converted z-statistics.()
- 2
Fail-safe number is calculated using the following formula proposed by Rosenthal (1991):
()

Notwithstanding the popularity of meta-analysis in aggregating studies to provide statistical evidence, a caveat is in order. All the studies selected for meta-analysis use some form of multivariate regression analysis to investigate the research questions. Lipsey and Wilson (2001, pp. 15–16) note that such multivariate analysis ‘can not generally be represented in an effect size statistic but a study may report the correlation matrix upon which the multiple regression is based . . . Meta-analysts have not yet developed effect size statistics that adequately represent this form [multivariate regression] of research finding.’ Notwithstanding this limitation, researchers have used meta-analysis in a number of academic areas to resolve inconsistent findings and this meta-review is one such example.
4: RESULTS

I coded every study ES as positive if it supported the expectation that NAF are negatively associated with FRQ. For example, studies documenting a positive association between earnings management and NAF imply low FRQ. On the other hand, studies documenting a negative association between NAF and unexpected earnings also imply low FRQ. If I had taken the original signs reported in the study then it would fail to capture the fact that both these groups of studies provide evidence that high NAF reduce FRQ. Therefore, to be consistent with the underlying interpretation, I coded every effect size as positive if it supported the hypothesis that high NAF reduce FRQ, even though the actual coefficient signs reported in the original studies could show otherwise. Reported results in Table 2 Panel A reveal that the overall mean ES is 0.021 with an associated z-statistic of 6.37 which is statistically significant at the 1% level, implying that the level of NAF has a negative effect on FRQ (the signs of these ES are reversed from negative to positive to be consistent with the definition of low FRQ).
Looking at the confidence interval of the ES distribution, it is concluded that there is a 0.95 probability that the true association between NAF and FRQ is between 0.015 and 0.027. Cohen (1977, 1978, cited in Lipsey and Wilson, 2001) reports a general rule of thumb statistic for interpreting ES values with ES≤0.20 representing the small range (analogous values for the correlation ES is 0.10). Pomeroy and Thornton (2008) report an average ES of −0.08 in their meta-analysis of the effect of audit committee independence on FRQ.
Table 2 also reveals that nine of the 12 studies on the association between earnings management and the level of NAF report positive ES supporting the conjecture that earnings management increases with the level of NAF. Eight of the 12 studies on the association between NAF and qualified audit opinion find that auditor propensity to provide clean opinion increases with the level of NAF. Archival research on the effect of NAF on the impairment of independence in appearance provides more robust evidence. Six of the nine studies on the association between NAF level and ERC and cost of debt reveal a detrimental effect of the NAF on these proxy measures. For example, Krishnan et al. (2005) document a negative ERC for firms receiving high NAF, indicating that investors perceive such higher fees as a threat to independence. Similar evidence is provided by Gul et al. (2006) in the Australian market. Two other studies by Brandon et al. (2004) and Dhaliwal et al. (2008) in the bond market setting report that there is a positive association between NAF and cost of raising bond capital.
A very limited number of studies investigate the effect of NAF on two other FRQ proxies, namely, earnings conservatism using Basu (1997)-type conservatism measure (two studies), and earnings restatement (four studies). Neither Ruddock et al. (2005) in Australia nor Zhang and Emanuel (2008) in New Zealand find any evidence that earnings conservatism is negatively associated with the level of NAF. The association between NAF and earnings restatement, too, has produced very mixed evidence. Agrawal and Chadha (2005) and Li and Lin (2005) report a positive, but statistically insignificant, association between NAF and the probability of earnings restatement. The positive coefficient is consistent with the argument that higher NAF increases the economic ties between the auditor and the clients, which in turn makes the auditors less willing to challenge questionable accounting practices. Some of these accounting practices may require restatements in the future. Kinney et al. (2004) and Bloomfield and Shackman (2008), on the other hand, show that earnings restatement is negatively associated with the level of NAF.
To ensure that these results are not driven by publication bias, I calculate the fail-safe number for these 38 studies and compare this with the critical fail-safe number. The calculated fail-safe number Nfs= 4,608 for the 38 studies included in Table 2 far exceeds the critical value of 200 calculated using equation (5). Publication bias, therefore, can be ruled out as a concern for the reported results. For individual categories too, this conclusion holds.
4.1: Test of Heterogeneity Among FRQ Proxies

Where ES is the individual effect size for i = 1 to k, and is the weighted mean ES over the k ES, and wi is the individual weight for effect sizei. The reported Q-statistic in Table 2 is 118.33, which is highly significant (the critical value is 73.4 at the p= 0.001 level, with 37 df and strongly rejects the homogeneity hypothesis, suggesting that that the FRQ proxies used in the extant literature do not measure the same underlying construct).
To further explore the source of such heterogeneity, I conduct an analysis of between-study variability as suggested by Pomeroy and Thornton (2008, p. 316):
- •
Group the financial reporting proxies into common categories (e.g., earnings management studies);
- •
Calculate an overall ES for each FRQ group following the ES calculation procedure outlined above;
- •
Calculate a Q-statistic for each group and test the homogeneity of variances assumption for each group;
- •
Sum the Q-values across all individual groups to obtain a within group Q and test for homogeneity;
- •
Finally, compare the within-group Q with the between group Q and test whether the difference is statistically significant.
Panel B of Table 2 presents the between-study variability result for three groups, and allows for testing the hypotheses developed in section 2. The mean ES of the studies investigating the association between earnings management and NAF is 0.03, with an associated z-statistic of 4.90 which is significant at the 1% level and therefore supports H1. Looking at the confidence interval of the ES distribution, it is concluded that there is a 0.95 probability that the true association between earnings management and NAF is between 0.017 and 0.04. The auditors' going-concern opinion-based FRQ proxy provides an overall ES of 0.017, with a z-statistic of 2.02 which is statistically significant at better than the 5% level, and thus supports H2. The true association between NAF and this FRQ proxy is 0.005 to 0.034. The overall ES of the group of studies investigating the association of NAF and market pricing of earnings is 0.019 with a highly significant z-statistic of 4.00. The true association is between 0.0095 and 0.028. H5 is therefore strongly supported. Two studies investigate the bond market effect of higher NAF and reach the conclusion that bond investors also consider high NAF to be a threat to independence. However, they report coefficients which are opposite in sign. Dhaliwal et al. (2008) measure the costs of debt using yield on the first bond issue where the higher the yield the higher the cost of debt. On the other hand, Brandon et al. (2004) use Moody's bond rating as a proxy for cost of debt, where a higher score represents a stronger bond rating. What this suggests is not only heterogeneity among FRQ proxies but also intra-group-heterogeneity.
Panel B also reports the Q-statistics for each component of FRQ proxies. Test of homogeneity of variances for the five categories of FRQ reveals that earnings management, qualified audit opinion and capital market-based studies suffer from heterogeneity of variances (reported Q-stat is 20.16, 52.15, and 37.38, respectively, all significantly larger than the critical chi-square value at better than the 5% level). This implies that empirical research on the association between NAF and FRQ proxies gives rise to significant intra-group heterogeneity with respect to the actual findings.
4.2: Alternative Fee Construct to Measure Auditor Independence
The Frankel et al. (2002) study uses FEERATIO as the primary independent variable capturing the possibility of financial dependence by the incumbent auditor on the client. However, if such financial dependence is the source of concern then the appropriate fee measure should be ‘total fees’ paid to the auditor rather than the non-audit component of total fees. DeFond and Francis (2005, p. 14) argue that while Arthur Andersen pocketed a large sum of NAF ($26 million) ‘there is no evidence whatsoever that non-audit fees and services were a source of Andersen's problems in the audit of Enron’. Table 3 Panel A reports the ES results from studies that use TOTFEE as the fee construct (20 studies are included for this analysis). Reported result reveals that the mean ES is 0.0051 with a z-statistic of 1.26 which is not significant at the conventional significance level.
The test of homogeneity reported in Panel B reveals that the ES for earnings management and audit qualification studies are negative but that of capital-based studies is positive and statistically significant at better than the 1% level. This finding is very important because it reveals that the market perceives NAS as a threat to independence in appearance but not to independence in fact. This finding, however, needs to be evaluated in light of the fact that the three FRQ proxies suffer from intra-group heterogeneity.
Tax services are unique among NAS because, (a) detailed tax laws must be consistently applied; (b) the tax authorities have discretion to audit any tax return; (c) tax services may have a direct and immediate impact on client income and cash flows through tax rate reduction; (d) tax services may also affect public policy in terms of their potential impact on tax equity (Omer et al., 2006); and (e) accounting firms have historically provided a broad range of tax services to their audit clients (SEC, 2003). The SEC (2003) refers to the SOX-2002 act for the guidance for the provision of tax services, ‘any non-audit service, including tax services, that is not described as a prohibited service, can be provided by the auditor without impairing the auditor's independence “only if” the service has been pre-approved by the issuer's audit committee’.8 The SEC favours its longstanding position that provision of tax services by the auditors to their clients does not impair independence. Accordingly, accountants may continue to provide tax services such as tax compliance, tax planning and tax advice to audit clients, subject to the normal audit committee pre-approval requirements.
The PCAOB, however, took a more restrictive stance in Ethics and Independence Rules Concerning Independence, Tax Services and Contingent Fees (PCAOB Release 2005-014, 26 July 2005). The PCAOB was concerned over two types of tax services: auditors' promotion of potentially abusive tax-shelter products to their public company audit clients and their promotion of these and other tax services to public company executives. After a roundtable session the Board concluded that a broad ban on tax services was not necessary to safeguard independence but proposed rules that restrict (a) auditors' involvement in certain aggressive tax-planning transactions, (b) tax services provided to a company's senior financial management, and (c) new procedures for audit committee preapproval. Empirical research on the effect of tax services on FRQ provides mixed evidence. Robinson (2008) and Huang et al. (2007), find no association between auditor provided tax services and the FRQ. But Mishra et al. (2005) reveal that shareholders perceive tax services as a threat against auditor independence. Four of the 45 studies initially selected for the meta-analysis use taxratio (TAXRATIO) as the proxy variable to measure auditor independence. Untabulated result reveals that the mean ES of these studies is −0.013 with a z-statistic of −1.18. This suggests that auditor-provided tax services do not seem to impair independence in fact, consistent with SEC's longstanding position of favouring auditor-provided tax services. However, a very small number of sample studies available for this analysis require a cautious interpretation of the finding.
4.3: Sensitivity Analysis
Multiple dependent variables Some of the earnings management and NAF studies used both DACCR and meeting some earnings benchmarks (analyst forecasts, report small increase in earnings) as earnings management proxies. Reported results in Table 2 include only DACCR-based results. The standard solutions for overcoming multiple dependent variable problem may be summarized as follows (DeCoster, 2004):
- 1
ES can be calculated for each construct and be entered into the same model. This is the easiest approach but violates the independence assumption.
- 2
ES can be calculated for each construct and perform a separate analysis on each. For example in this meta-analysis we can calculate ES for DACCR and benchmark beating separately. This, however, is only feasible if the each response measure is used in a number of different studies.
- 3
Mathematically combine the two ES into one. This is the most preferred method.

Where, ESi is the ES for the ith measure, ρ is the typical intercorrelation between the response measures, and m is the number of alternative constructs being combined. One problem with using Rosenthal and Rubin's (1986) equations is that many studies do not report intercorrelation between the alternative measures. This is particularly acute in the present case as none of the earnings management studies that uses benchmark beating as an alternative earnings management construct provides the required inter-correlation to conduct this analysis.
The next preferred course of action is (2) above which requires a separate estimation of the ESs associated with benchmark beating constructs. I included four studies that use meeting analyst forecasts as an earnings benchmark for this analysis. Research has found that investors pay a premium for firms that meet and/or beat analyst forecasts (Kasznik and McNichols, 2002; Mikhail et al., 2004; Brown and Caylor, 2005). Following this procedure I find the mean ES for this very small sub-group of studies to be 0.022 with a z-statistic of 2.56 which is statistically significant at better than the 5% level. This evidence suggests that managerial propensity to manage earnings is positively associated with the level of NAF when alternative earnings management proxy is used.9
Studies conducted using data from the U.S.A. versus other countries Of the 38 studies selected for meta-analysis in the primary analysis (Table 2), 23 studies use data from the U.S. market while the remainder are from Australia, New Zealand, the United Kingdom, Canada and Bangladesh. Whether the results derived from the U.S. market is generalizable to other countries requires empirical testing because of country-specific differences in reporting incentives attributed to a number of factors including differences in legal system (Habib, 2007). It should be recognized, however, that all the countries studied share common law legal origins and therefore are expected to share a lot of similarities with respect to reporting environment. But a major difference in the reporting environment between the U.S.A. and the rest of the countries is that U.S.A. is characterized by a relatively more litigious environment compared to the rest of the sample countries which may constrain managerial propensity to compromise independence and compel managers to provide more credible financial information. Khurana and Raman (2004) report that Big 4 audit is associated with a lower cost of equity (a proxy for audit quality) in the U.S.A. but not in Canada, Australia or the U.K.
I compute mean overall ES for these two groups of countries using FEERATIO as the independence proxy variable and report the results in Table 4 Panel A. Results reveal that the mean ES is positive and statistically significant for both country groups implying that overall FRQ is negatively associated with the level of NAF. Individual FRQ category analysis shows that earnings management impairs independence in fact both inside and outside the U.S.A. Capital market-based studies, too, provide evidence that the level of NAF is perceived as a threat to independence, again both inside and outside the U.S.A. The association between NAF and the propensity to issue qualified opinions is found to be insignificant for both country categories. Taken together, the evidence indicates that capital market participants consider the provision of NAS as a threat against auditor independence and this may provide some justification for the regulatory ban on NAS.
Panel A: The association between NAF and FRQ using study results from two broad country categories [FEERATIO] as the primary independent variable | ||||||||
---|---|---|---|---|---|---|---|---|
Number of studies | All categories | Earnings management | Audit opinion | Earnings conservatism | Earnings restatement | Capital market studies | ||
U.S.A. | 23 | Mean ES | 0.02 | 0.026 | 0.019 | None | 0.0006 | 0.017 |
z-stat | 5.33*** | 4.08*** | 1.10 | – | 0.02 | 3.54*** | ||
Q-values | 77.16*** | 10.90*** | 28.57*** | – | 5.87*** | 30.02*** | ||
Outside U.S.A. | 15 | Mean ES | 0.027 | 0.05 | 0.011 | 0.011 | – | 0.04 |
z-stat | 3.69*** | 3.03*** | 1.32 | 0.58 | – | 2.68*** | ||
Q-values | 42.93*** | 7.25*** | 24.32*** | 1.97 | 9.06*** |
Panel B: The association between NAF and FRQ using study results from three journal categories | ||||||||
---|---|---|---|---|---|---|---|---|
Number of studies | All categories | Earnings management | Audit opinion | Earnings conservatism | Earnings restatement | Capital market studies | ||
TOP 5 | 11 | Mean ES | 0.03 | 0.034 | −0.016 | – | – | – |
z-statistic | 5.28*** | 5.17** | 0.87 | – | – | – | ||
Q-values | 19.20*** | 8.52*** | 2.34 | – | – | – | ||
Others | 20 | Mean ES | 0.01 | 0.03 | −0.0074 | – | 0.0006 | 0.013 |
z-statistic | 2.46** | 1.69* | −0.88 | – | 0.02 | 2.65*** | ||
Q-values | 58.30*** | 4.94 | 17.47*** | – | 5.87* | 13.17*** | ||
AJPAT | 7 | Mean ES | 0.023 | – | 0.06 | – | – | 0.07 |
z-statistic | 3.17*** | – | 4.15*** | – | – | 3.78*** | ||
Q-values | 94.80*** | – | 17.68*** | – | – | 0.62 |
- Notes: The top five accounting journals are The Accounting Review (TAR), Journal of Accounting Research (JAR), Journal of Accounting and Economics (JAE), Accounting, Organizations and Society (AOS) and Contemporary Accounting Research (CAR). None of the papers selected for the present meta-analysis appeared in the JAE and AOS.
- ES is calculated using the following formula:
- The formula is straightforward to apply when the sample studies reported t-statistics. The majority of the studies canvassed for meta-analysis, however, reported p-values. Reported p-values are converted into t-statistics using p-value tables.
- Standard error is calculated using the following formula:
- Lower and upper bound of the ES are calculated as ES− (1.96*SE) and ES+ (1.96*SE) respectively.
- To test whether the various ESs that are averaged into a mean value are estimating the same population ES, a Q-statistic, which is distributed as a chi-square with k− 1 degrees of freedom where k is the number of ES, is calculated using the following formula (Lipsey and Wilson, 2001, p. 115).
-
Where effect sizei (ES) is the individual ES for i = 1 to k, and
is the weighted mean ES over the k ES, and wi is the individual weight for effect sizei.
- ***,** and *** denote statistical significance at 1%, 5% and 10% level, respectively (two-tailed test).
Journal quality and the association between NAF and FRQ Whether empirical evidence published in top quality journals differs from other journals is also a relevant issue. Hay et al. (2006, p. 157) argue that top quality journals ‘publish higher-quality studies with more robust findings; but there is also a greater possibility of bias, as it is possible that editors are more likely to reject findings that are not interesting because they are not significant’. Table 4 Panel B, therefore, presents a sensitivity analysis comparing studies published in the top five accounting versus other journals (accounting as well as other non-accounting academic journals) to determine whether the results of the research questions vary between these two groups of journals. Eleven of the 38 studies included in this meta-analysis appear in the top three accounting journals (none of the studies appears in Journal of Accounting and Economics and Accounting, Organizations and Society which are ranked among the top five accounting journals). The other 20 studies appear in other accounting and non-accounting journals. I have also grouped seven studies that appear in the specialized auditing journal Auditing: A Journal of Practice and Theory (AJPAT). Evidence reveals that for all these there sub-groups, it appears that increased NAF reduces FRQ, as is evident from a positive and statistically significant overall ES. For individual FRQ proxies, it appears that earnings management increases with the level of NAF in studies published in the top five journals (ES 0.026, z-statistic 4.53, significant at the 1% level). For other journal category, too, this findings hold although the ES is marginally significant. The association between NAF and audit opinion is only significant for studies published in the AJPAT (ES 0.06, z-statistic 4.15, significant at the 1% level). Also for this group of studies, there is strong evidence that ERC decreases with the increase in NAF (ES 0.07, z-statistic 3.78, statistically significant at the 1% level).
5: IMPLICATIONS
5.1: Implications for the Regulators
How does this meta-analysis inform regulators regarding the desirability of restricting auditor-provided NAS to the clients? Regulators, particularly in the U.S.A., seem convinced that the provision of certain kinds of NAS impairs auditor independence, and therefore the SEC restricts those specific types of NASs. The finding from this meta-analysis supports such a regulatory action by documenting the detrimental effect of NAF on FRQ. Although the ES of 0.021 is small, it is statistically significant and this small ES should not be surprising. If it were large, then that would suggest that NAS are the major contributor to good or bad FRQ. However, many other variables besides NAS purchase will influence the quality of financial statements.10 What is interesting is that, although the evidence on the independence—proxied by earnings management and auditors' propensities to issue qualified audit opinion—is mixed, there is robust evidence that the stock market reacts negatively to earnings for firms where the incumbent auditor receives a high NAF. This meta-analysis finding, however, needs to be carefully interpreted in light of the fact the underlying studies being examined are not homogeneous and therefore, the results may not be reliable for policy making purposes. Additionally, regulators are more concerned with preventing a major audit failure, and may, therefore, be less interested in the small incremental results that might show up in studies examined here.
5.2: Implications for Future Research
The earnings quality metrics used in the extant research are all justifiable. However, the reliability of such measures needs to be subjected to additional tests. Future research could examine the association between NAF and balance sheet-based FRQ measures as proposed by Barton and Simko (2002). With the availability of disaggregated NAF data, an exciting avenue for future research would be to link different categories of NAF with FRQ measures based on a careful theorization of the possible impact of such disaggregated NAF.
Heterogeneity in FRQ proxies has been identified as the primary driver behind the poor correlation observed between NAF and FRQ proxies. Future research should investigate whether such heterogeneity is desirable. Hunter and Schmidt (1990) argue that broad constructs like FRQ can and should be operationalized in a number of ways, as long as they are informed by rigorous theoretical underpinnings. Why does this heterogeneity exist with respect to the operationalization of the FRQ construct? Pomeroy and Thornton (2008) identify publication bias and accounting researcher incentives as the two most important drivers of diversity and inconsistency in selecting the dependent variable, FRQ in this case. The authors state that, ‘even when we include all known studies of AC [audit committee] effectiveness to date, both published and unpublished, we cannot reliably aggregate across all of them because of significant inconsistencies in defining the construct “financial reporting quality” and a relative absence of studies designed to replicate, and thereby enhance confidence in, the results of previous studies’ (emphasis added, p. 319). However, replication bias is less pronounced in NAF and FRQ literature. An example is a number of replication studies (at least three) documenting no association between NAF and FRQ in response to the original Frankel et al. (2002) study which documented that auditors receiving higher NAF allow managers to engage in earnings manipulation practices. Another example of the absence of such publication bias relates to the effect of NAF on auditor reporting decisions. This stream of research generally provides evidence supporting the hypothesis that high NAF compromises auditor independence by making auditors more inclined to issue clean audit reports even when a qualified audit report is justified. Researcher incentives, on the other hand, seem to constrain successful meta-analysis in the NAF and FRQ domains, as is consistent with Pomeroy and Thornton (2008). Successful researchers need to differentiate themselves by proposing and testing novel and innovative measures of FRQ proxies. However, in doing so they run the risk of moving away from the mainstream, even though the reliability of the findings from the mainstream could have been questioned (Bamber et al., 2000).
Meaningful interpretation of the present meta-analysis is further complicated by the fact that there is yet to be any consensus on which fee variable truly captures compromise of audit independence. The popular fee construct has been the FEERATIO, which measures NAF as a percentage of total fees. This ratio, however, fails to consider the economic significance of total fees and has, therefore, been criticized as a valid measure of the economic bond between the auditor and the client. Further research should be undertaken to explore the theoretical rationales for using one fee measure over the other.
Another shortcoming of the studies included for meta-analysis is the piecemeal approach adopted by researchers for addressing the relationship between NAF and FRQ. For example, capital market effect studies investigate the market reaction to unexpected earnings conditional on the level of NAF. The underlying theory is that the market considers such earnings as tainted and penalizes accordingly. However, this assumption will be supported only when the discretionary accruals component of earnings is negatively priced for firms with high NAF. But earnings management research does not provide overwhelming evidence of firms with high NAF engaging in more earnings manipulation. Additionally, managers could use discretionary accruals for both opportunistic as well as efficient contracting purposes (Subramanyam, 1996). Income increasing discretionary accruals allowed by audit firms with high NAF do not necessarily imply dysfunctional behaviour by corporate managers. Market reaction would be a nice setting to test these propositions.
6: CONCLUDING REMARKS
It has been well documented that auditing serves as an effective corporate governance mechanism. However, audit quality has come under serious scrutiny because of the joint provision of audit and NAS, and regulators consider such services to be a threat to auditor independence. A sizable volume of archival research has developed over the years to address whether such a presumption is supported by actual data. The purpose of this paper is to conduct a systematic meta-analysis to provide a statistical assessment of the association between NAF and FRQ. Meta-analysis is superior to narrative literature review because it aggregates results across individual studies statistically and corrects for statistical artefacts like sampling and measurement error, thereby providing much greater precision with respect to the findings compared with narrative reviews. Meta-analysis results provide some evidence that investors perceive NAF as a threat to independence, as is evident from the negative association between the level of NAF and the ERC. However, the lack of homogeneity in the underlying studies remains to be a concern.