Within-Industry Specialization and Global Market Power

Ahmad Lashkaripour

Within-Industry Specialization and Global Market Power

Ahmad Lashkaripour (Indiana University, CESifo, CEPR)

American Economic Journal: Microeconomics · February 2020

Read PDF · Markdown source · Reader view · Replication files

Abstract. Export price levels exhibit tremendous cross-national and spatial variation, even within narrowly-defined industries. Standard theories attribute this variation to within-industry quality specialization. This paper argues that a significant portion of the export price variation is driven by rich and remote economies specializing in high-market power, high-returns-to-scale segments of industries. I also argue that this particular pattern of specialization (i) accounts for 30% of the overall gains from trade, and (ii) explains more than 37% of the observed cross-national income inequality.

Machine-readable full text

This section was extracted with OpenDataLoader PDF from the hosted PDF so the full text is accessible in HTML and Markdown.

American Economic Journal: Microeconomics 2020, 12(1): 75–124 https://doi.org/10.1257/mic.20170336

Within-Industry Specialization and Global Market Power†

By Ahmad Lashkaripour*

Export price levels exhibit tremendous cross-national and spatial variation, even within narrowly defined industries. Standard theories attribute this variation to within-industry quality specialization. This paper argues that a significant portion of the export price variation is driven by rich and remote economies specializing in high-market power segments of industries. I also argue that this particular pattern of specialization (i ) accounts for 30 percent of the overall gains from trade, and (ii ) explains more than 37 percent of the observed cross-national income inequality. (JEL D43, F12, F13, F14, L15, L22, O19)

F

or some time now, there has been a consensus among trade economists that international specialization occurs predominantly within rather than across

industries (Schott 2004). Accordingly, there has been a surge in new theories of within-industry specialization, many of which are motivated by two observations. First, there is tremendous variation in export price levels even within narrowly defined industries. Second, these variations are systematic:1

(i) Export price levels increase with the exporter's income per capita; and
(ii) Export price levels increase with geo-distance to global markets.

The dominant thesis in explaining the above facts is that high-income and geographically distant economies specialize in high-quality varieties within industries. Quality specialization, in these instances, is fueled by either factor-driven comparative cost advantage (Schott 2004), geography-driven comparative cost advantage (Hummels and Skiba 2004), or the home-market effect (Fajgelbaum, Grossman, and Helpman 2011).2 The quality specialization thesis has proven to be quite influential, motivating a vibrant strand of macroeconomic growth theories (Grossman and Helpman 1991), as well as enhancing our understanding of

*Assistant Professor of Economics, Indiana University, 100 S. Woodlawn Ave., Bloomington, IN 47405 (email: alashkar@indiana.edu). John Asker was coeditor for this article. I thank seminar participants at various venues for providing helpful comments and suggestions that improved this paper. Special thanks to Angela G. Campbell, Farid Farrokhi, Cecilia Fieler, James Tybout, and the four anonymous referees. Finally, I owe a debt of gratitude to my former advisors Jonathan Eaton and Stephen Yeaple for their invaluable encouragement and support.

†Go to https://doi.org/10.1257/mic.20170336 to visit the article page for additional materials and author disclosure statement(s) or to comment in the online discussion forum.

1See Schott (2004), Hummels and Skiba (2004), and Hummels and Klenow (2005).
2See also, among others, Sutton (2007); Hallak (2006); Choi, Hummels, and Xiang (2009); Khandelwal

(2010); Baldwin and Harrigan (2011); Crozet, Head, and Mayer (2012); Johnson (2012); Lugovskyy and Skiba

(2015); Feenstra and Romalis (2014); Sutton and Trefler (2016); and Dingel (2016), who emphasize the quality specialization channel.

75

trade-induced inequality (see Verhoogen 2008 and Fajgelbaum, Grossman, and Helpman 2011).

This paper, however, argues that this dominant thesis goes too far in attributing facts (i) and (ii) to exclusively quality specialization. As a matter of arithmetic, price depends on (i) the marginal cost that varies with output quality, and (ii) the markup that varies with the degree of market power. An accumulating body of evidence indicate that the markup component is nontrivial.3 Even more revealing, I present new evidence that aggregate measures of export markup covary with income per capita and geo-distance in a fashion that exactly mirror the variation in export price levels.

Motivated by such evidence, I propose an alternative view of within-industry specialization that emphasizes the market power channel. That is, industries are comprised of multiple segments with differential degrees of market power.4 Firms in each country sort into high- and low-market power segments of industries, taking into account aggregate cost measures such as the wage rate or distance to global markets. Collectively, these firm-level decisions lead to international specialization across low-market power and high-market power segments of industries. This pattern of specialization, in turn, provides an alternative explanation for facts (i) and (ii). More importantly, it delivers a set of macro-level implications that are starkly different from the quality specialization thesis.

I establish my theory using a general framework that nests an extensive class of trade models featuring firm heterogeneity, quality-sorting, and variable markups,

as a special case. Extending the result in Arkolakis et al. (2019), I show that in this general framework, variable markups alone cannot explain the cross-national heterogeneity in export price levels to a given market. However, under a set of testable assumptions, such heterogeneity can arise from within industry specialization across low- and high-markup segments of industries.

The intuition behind my theory is the following. Goods manufactured with high-wage labor or transported over long distance exhibit a cost disadvantage. To circumvent such a disadvantage, high-wage and remote economies specialize in industry segments where aggregate demand is relatively less sensitive to cost and relatively more sensitive to output quality. I label such segments of an industry as " quality-intensive." From the perspective of a standard trade model, "quality-intensive" segments are those that are subject to a lower trade elasticity.

My testable claim is that quality-intensive segments are also those where individual firms enjoy more market power. Hence, international specialization across low- and high-quality intensive segments can be interpreted as specialization across low- and high-markup segments. Under this interpretation, international specialization creates a positive link between an economy's wage rate or distance to global markets and its average export markup levels. This positive link, in turn, provides an alternative markup-based explanation for facts (i) and (ii) presented earlier.

3For instance, according to Berry, Levinsohn, and Pakes (1995), more than 30 percent of the price variation in the auto industry is due to markup differences.
4The view that industries consist of segments with differential degrees of market power has deep roots in the literature. See, for example, the case of specialty versus popular-market beer in Hausman, Leonard, and Zona (1994) or luxury versus economy cars in Berry, Levinsohn, and Pakes (1995) and Goldberg (1995).

Employing rich micro-level trade data, I present support for the main assumption underlying my theory. That is, I estimate that the degree of firm-level market power is significantly greater in high quality-intensive segments of industries. Furthermore, I find that the composition of country-level exports aligns with the prediction that high-wage and distant economies export relatively more in high-market power segments of each industry.

Beyond this evidence, the present theory fits well with the observation that markup levels have diverged between rich and poor economies in the past few decades (De Loecker and Eeckhout 2018; Díez, Leigh, and Tambunlertchai 2018). Relatedly, it can explain why local prices and markups have risen in Europe (De Loecker, Fuss, and Van Biesebroeck 2014) but have fallen in China following trade liberalization (Brandt et al. 2017). From the standpoint of standard variable markup theories, the asymmetric effects of import competition on high- versus low-wage economies are difficult to explain. The present theory, however, predicts that import competition will induce European firms to sort into high-market power segments of industries, and Chinese firms to sort into the low-market power segments. These asymmetric effects will raise the average market power and markup of European firms while reducing them for firms in China.

At the macro level, my view of within-industry specialization produces distinct implications relative to the quality specialization thesis. The reason being that specialization, here, occurs across industry segments that differ in their (i) degree of market power, (ii) degree of returns to scale, and (iii) trade elasticity. Considering that the latter two characteristics regulate aggregate productivity and the gains from trade, within-industry specialization along these margins can profoundly influence macroeconomic outcomes.

To quantify macro-level implications, I estimate my model using industry-level trade and production data from the World Input-Output Database, covering 35 industries and 32 major economies. Using the estimated model, I find that the gains from trade are (on average) 20 percent in terms of real GDP. Of the total gains, 6 percent is driven by within-industry specialization across low- and high-market power segments, while the remaining 14 percent is driven by traditional forces in the Arkolakis, Costinot, and Rodríguez-Clare (2012) framework.

There is, however, tremendous cross-national heterogeneity in the gains from within-industry specialization. Advanced or remote economies like Australia, France, and Germany gain significantly more as they specialize in high-market power segments where the degree of scale economies and returns to specialization are higher. In comparison, the pure gains from specialization are much smaller (and even negative) for developing economies like Mexico and Indonesia, as they specialize in low-returns-to-scale segments where returns to specialization are relatively low.

At a broader level, the present model sheds new light on cross-country income differences that are believed to be puzzlingly large and have attracted considerable attention in the literature.5 From the perspective of the present theory, high-TFP

5See, for example, Klenow and Rodríguez-Clare (1997); Prescott (1998); Hall and Jones (1999); Parente and Prescott (1999); Acemoglu, Johnson, and Robinson (2000); Parente and Prescott (2000); Caselli (2005); Jones

(2011).

economies tend to specialize in high-market power segments where returns to scale and specialization are higher. This pattern of specialization, in turn, multiplies the fundamental TFP differences between rich and poor economies. Based on my estimated model, within-industry specialization acrosshigh- and low-returns-to-scale segments can explain up to 37.3 percent of the cross-national real income inequality.

Aside for the vast literature on quality specialization, this paper is closely related to a growing literature on variable markups and pricing-to-market in international settings. Generally speaking, this literature is either focused on (i) the procompetitive effects of trade (Melitz and Ottaviano 2008; Spearot 2013; Holmes, Hsu, and Lee 2014; de Blas and Russ 2015; Edmond, Midrigan, and Xu 2015; De Loecker et al. 216), or (ii) export price discrimination across markets (e.g., Hummels and Lugovskyy (2009), Alessandria and Kaboski 2011, Simonovska 2015, Bertoletti and Etro 2017). In comparison, this paper concerns (i) the heterogeneity in export price levels within markets as well as (ii) the asymmetric effects of trade liberalization on markups in low- versus high-income economies.

My theory also shares common elements with Fieler (2011).6 In both models, rich economies specialize in low trade elasticity product categories. The present theory, however, exhibits two key differences. First, it identifies a new source of heterogeneity in the trade elasticities, whereby differences in the segment-level trade elasticities are driven by differences in the degree of quality-intensity. Second, my theory establishes a systematic link between patterns of specialization, degree of global market power, and export price levels, which are absent in Fieler (2011). That being the case, my model delivers strictly different macro-level predictions relative to the perfectly competitive model in Fieler (2011).

Finally, the present paper is related to a growing literature concerning aggregation bias in the estimated gains from trade (e.g., Costinot and Rodríguez-Clare 2014, Levchenko and Zhang 2014, Ossa 2015, Brooks and Pujolas 2016). My paper contributes to this literature by identifying a systematic source of aggregation bias that is triggered by specialization across low- and high-returns-to-scale segments of industries.

###### I. Suggestive Evidence

As noted in the introduction, many theories of within-industry specialization are motivated by the fact that export price levels increase systematically with the exporting economy's (i) income per capita and (ii) geo-distance to global markets. Moreover, the thesis underlying these theories is that these two facts are driven by quality specialization within industries—see Schott (2004) and Hummels and Skiba (2004), among others.

In this section, I present suggestive evidence that facts (i) and (ii) can be partially attributed tocross-national differences in export markup levels. That is, they arise due to high-income and remote exporters either charging higher markups for the same good or specializing in the production of high-markup varieties within industries.

6Relatedly, see Kraay and Ventura (2007); Coibion, Einav, and Hallak (2007); McCalman and Spearot (2013); and Holmes and Stevens (2014) for other papers that emphasize specialization across more- and less-differentiated goods.

As a useful benchmark, I first present the evidence corresponding to facts (i) and (ii). I then present additional evidence that highlights the importance of the previously overlooked markup channel. My analysis here employs two different datasets. To analyze export price levels, I use the the Comtrade-baci database. To study aggregate markup levels, I use the the Cepii TradeProd database (De Sousa, Mayer, and Zignago 2012). To make the two datasets compatible, I restrict my analysis to 149 economies and 27 three-digit ISIC industries that are represented in both datasets. Even after making the two datasets compatible, the Comtrade-baci database remains considerably more granular, as it reports bilateral trade statistics

at the 6-digit Harmonized System (HS) level of aggregation. By comparison, the TradeProd database only reports production statistics at the 3-digit ISIC level of aggregation—a more comprehensive description of both datasets in provided in Appendix A.7

The effect of income per capita and geographical remoteness on export price levels can be estimated using the following equation:

(1) lnp– jk t = β 1 ln ̅ Dist j + β 2 lnGDPcap j t + β 3 lnGDP j t + δ k t + ε j k t ,

where p– j k t denotes the ( quantity-weighted) unit price of country j 's exports in industry k in year t . On the right-hand side, ̅ Dist j is country j 's population-weighted distance to global markets; GDPcap j t and GDP j t control for country j 's GDP per capita and market size in year t ; and, finally, δ k t controls for industry-year fixed effects.

Since the BACI data reports trade statistics at HS6 level of aggregation, I conduct the above estimation independently with both HS6-year and ISIC-year fixed effects. Estimation results are reported in Table 1, reestablishing the widely held belief that export price levels increase systematically with the exporting economies GDP per capita and remoteness.

Noting that export price levels, P = m  (φ) , are composed of a markup component, m , and a marginal cost component,  (φ) , there are two ways to interpret the cross-national export price variation. The standard "quality specialization" thesis attributes the variation to cross-national quality differences. That is, the patterns documented in Table 1 are attributed to high-income and remote economies specializing in high- φ ( high-quality) varieties within industries.8

Below, I present evidence in favor of the alternative view that the observed export price variation is driven partly by markup differences. To present my claim, I first calculate aggregate markup levels by closely following the methodology in De Loecker and Eeckhout (2018). That is, I assume a conventional production function for country j in industry k in year t . Namely, Q j k t = T j k t V j α k t

K β j k t

, where Q jkt denotes the units of output produced, V j k t denotes the variable input (e.g., labor, intermediate inputs, etc.), K j k t denotes capital, and T j k t denotes Total Factor Productivity (TFP). As in De Loecker and Eeckhout (2018), this production

k t

7The TradeProd database has a more comprehensive longitudinal coverage. It spans 26 years from 1980 to

The Comtrade-baci database, by comparison, covers the years 2003 to 2015.

8To be more specific, existing theories attribute the effect of GDP per capita to factor-driven or demand-driven

quality specialization, and attribute the effect of remoteness to the "Washington Apples" effect or quality-sorting.

Table 1—Cross-National Variation in Export Price and Markup Levels Dependent (log)

Unit price Markup Remoteness (log) 0.145 0.242 0.290

(0.002) (0.029) (0.016) GDP per capita (log) 0.141 0.162 0.026

(0.002) (0.014) (0.003) GDP (log) − 0.058 − 0.062 0.001

(0.001) (0.008) (0.003)

Observations 4,345,390 50,851 Fixed effects HS6 × year ISIC × year ISIC × year R 2 0.64 0.37 0.53

Notes: The estimating equations are (1) and (2). In the markup regression, each observation is weighted by the number of represented firms, which is proportional to the number of workers based on the forthcoming model. Robust standard errors are reported in parentheses.

structure implicitly assumes that countries vary in their TFP and the chosen inputs, but not in the technology Q .

The Lagrangian of the cost minimization problem is thus given by  j V k t V j k t +  K j k t K j k t − ν j k t ( T j k t V j α k t

− Q– j k t ) , where  j V k t and  K j k t denote input prices, ν j k t is the Lagrangian multiplier, and Q– j k t is a constant. The first-order condition of this problem with respect to V j k t implies that ν j k t =  V j k t V j k t / α k t Q jk t . Given that the shadow price of the Lagrangian is, by construction, equal to the marginal cost, the markup m ji,k =  j k t / ν j k t can, therefore, be expressed as

K j β k t

k t

 _ j k t Q j k t  j V k t V j k t

m j k t = α k t

.

Given the above expression, I can calculate m j k t up to an industry × year-specific output elasticity, using data on total output,  j k t Q j k t , and variable input cost,  V j k t V j k t , from the Cepii TradeProd database. Doing so, I can run the following regression that is analogous to (1), but includes m as the dependent variable:

(2) lnm j k t = β 1 ln ̅ Dist j + β 2 lnGDPcap j t + β 3 lnGDP j t + δ k t + ε j k t .

As before, δ k t , in the above equation, controls for industry-year fixed effects and absorbs all the variation in the output elasticity, α k t .9

The last column in Table 1 reports the estimation results corresponding to the markup regression. Evidently, the output sold by high-income and remote economies exhibits a systematically higher-markup level. Moreover, the elasticity at which markups increase with GDP per capita and remoteness closely resembles the corresponding elasticity for export prices. The higher number of observations in the

9It should be reemphasized that the assumption that α k t is industry × year-specific has precedent in the literature. De Loecker and Eeckhout (2018) and Díez, Leigh, and Tambunlertchai (2018), for instance, adopt a similar assumption when analyzing global market power using firm-level data.

first two columns is simply reflective of the Comtrade-baci data reporting export price levels at a much finer level of aggregation.

While revealing, the evidence presented in Table 1 is subject to an important caveat. I use country-level production data to identify the aggregate markup level in each economy. So, if firm-level markup distribution is highly dispersed within each economy, the results presented in Table 1 may be be subject to aggregation bias; the extent of which depends on how the firm-level markup dispersion covaries with GDP per capita and remoteness.10

II. A Model of Within-Industry Specialization

Consider a world economy consisting of i = 1,... , countries. Country i is

populated by L i individuals, each endowed with one unit of labor, which is the sole factor of production. All individuals are perfectly mobile across the production

of different goods but are immobile across countries; and are paid a wage, w i , in

country i . There are  industries indexed by k , each comprised of multiple segments that differ in characteristics such as degree of market power and quality intensity.

Segments are indexed by z , with  k denoting the set of all segments pertaining to industry k —for example, k can be the auto industry, which is composed of luxury, economy, and cross-over vehicle segments.

Consumers and Preferences.—All individuals have similar preferences that are characterized by a general utility function, U (  1 ,... ,  ) , where  k is the subutility corresponding to industry k . The industry-level subutility is a function of the composite consumption bundle, which combines goods from various segments in that industry. In particular,

 k =  k ( C 1 ,... ,C  ) ; 1,..., ∈  k

where the segment-level subutility, C z , is a function of the quantity q ≡ [ q ω ] ω∈Ω z and quality φ ≡ [ φω ] ω∈Ω z of firm-level varieties (indexed by ω ) available to the consumer in segment z. Contrary to the dominant approach in the literature, I impose only weak isomorphism between quality and quantity, which is outlined by the following assumption.

ASSUMPTION A1: Quality and quantity are only weakly isomorphic. In

particular, Cz is a function of the quantity and quality of the goods in one's consumption basket:

C z ( φ ρ z q 1−ρ z ) .

Correspondingly, ρ z ∈ [0,1] denotes the relative importance of product quality in segment z , which I hereafter refer to as the degree of quality intensity.11

10Importantly, running regression (2) using the markups estimated by De Loecker and Eeckhout (2018) with

firm-level data for a sample of 40 countries produces qualitatively similar results.

11In the CES case, Assumption A1 can be stated as C z ( φ ρ z q 1−ρ z ) = [ ∫ ω∈Ω z ( φ ω ρ z q ω 1−ρ z ) η z dω] 1/η z (1−ρ z ) .

Note that Assunption A1 imposes weaker conditions on preferences than conventional theories, which typically assume exact isomorphism between quality and quantity (see Melitz and Redding 2014).12 To elaborate, exact isomorphism corresponds to a special case of Assumption A1, where ρ z is uniform across all segments. In such a case, there is no real distinction between quality and quantity, and productivity differences across producers can be interpreted as quality differences and vice versa.13 There is, however, exhaustive evidence from consumer surveys that individuals care relatively more about quality (and relatively less about price) when purchasing certain product categories (see Fornell et al. 1996). By relaxing the uniformity restriction on ρ z , Assumption A1 accommodates this feature of the data. Accordingly, I will hereafter refer to high- ρ segments as "high quality-intensive" segments of the industry.

According to Assumption A1, consumers maximize their utility along both the quality and quantity dimensions. To streamline the presentation, however, I can convert the consumer problem into a one-dimensional one that is expressed in terms of effective (or quality-adjusted) quantities and prices. To this end, I use the tilde notation to denote effective values, with q̃ = ( φ 1−ρ z q ρ z ) 1/ρ z and p̃ ≡ ( p ρ z / φ 1−ρ z ) 1/ρ z , respectively, denoting effective quantity and price. Considering this choice of notation, I assume that U i ( · ) and  i,k ( · ) are additively separable and C z ( · ) exhibits a parameterization that delivers the following class of demand functions.

ASSUMPTION A2 (Generalized Gorman-Pollak Demand): For a consumer with

income y, facing a schedule of effective prices p ̃ ≡ [ p̃ ω ] ω∈Ω , the Marshallian demand for variety ω ∈ Ω assumes the following formulation:

q̃ ω =  z ( F z ( Λ z ) p̃ ω / y)  z ( Λ z )

where  z ( · ) is a strictly decreasing function; F z ( · ) and  z ( · ) are differentiable functions; and Λ z ≡ Λ z ( p ̃ ,y) is a scalar demand shifter that firms take as given and that implicitly solves the budget constraint.14

The above demand system nests (i) the directly separable preferences in

Arkolakis et al. (2019) as a case where  z ( Λ z ) is constant and F z ( Λ z ) = Λ z , (ii) the indirectly additive preferences in Bertoletti, Etro, and Simonovska (2018) as a case

where F z ( Λ z ) is constant and  z ( Λ z ) = Λ z , and (iii) the homothetic single aggregator in Matsuyama and Ushchev (2017) as a case where F z ( Λ z ) =  z ( Λ z ) = Λ z . Another well-known special case of Assumption A2 is the CES demand system, which arises when F z ( Λ z ) is constant,  z ( Λ z ) = Λ z , and  z (x) = x −ε z .15

12Hallak and Sivadasan (2013) and Sutton (2007) have challenged the exact quality-quantity isomorphism from a theoretical perspective, while Bils and Klenow (2001); Baldwin and Ito (2008); Fan, Li, and Yeaple (2015); and Flach (2017) have challenged it based on empirical evidence.
13As we will see shortly, the multi-industry Melitz-Pareto model in Chaney (2008) corresponds to a special case of the present model where ρ z is uniform and equal to 1/ 2 in all industries.
14Specifically, Λ z is an implicit solution to ∫ ω∈Ω z p ω  z ( F z ( Λ z ) p̃ ω / y)  z ( Λ z ) dω = e z α k y , where α k and e z ,

respectively, denote the share of expenditure on industry k and segment z ∈  k .

15See Fally (2019, 2018) for a more thorough discussion regarding the generalized Gorman-Pollak demand

system.

It is important to note that the theoretical propositions that follow apply to several demand systems that are not readily nested by Assumption A2. For instance, the theory presented here applies equally to (iv) the quadratic but nonseparable demand system (Ottaviano, Tabuchi, and Thisse 2002; Melitz and Ottaviano 2008); (v) the QMOR expenditure function, as well as (vi) Kimball preferences (e.g., Kimball 1995; Klenow and Willis 2016).

To simplify the notation, and given that the schedule of prices, p ̃ , and income, y , vary across markets, I henceforth use Λ i,z ≡ Λ z ( p ̃ i ,y i ) ;  i,z ≡  z ( Λ i,z ) and F i,z ≡ F z ( Λ i,z ) to denote demand shifters associated with country i 's market in segments z .

Firms and Production.—Every industry k is populated with N j,k firms from

country j , which compete under monopolistic competition. There is free entry, whereby a large pool of ex ante identical firms in country j can pay an entry

cost, w j f k e , to enter the industry. The free entry assumption introduces scale effects that magnify the extent of international specialization, but is not consequential to the main predictions of the model.

Firms operating in segment z of industry k are heterogeneous in their output quality, φ, which is the realization of a random variable drawn independently across firms from a distribution G j,z . As a key assumption, I impose that G j,z is Pareto with the same shape parameter γ k > 0 in all countries and in all segments pertaining to industry k .

ASSUMPTION A3: For all φ and all z ∈  k , Gj,z (φ) = 1 −  j,z φ − γ

k , with γ k > 0 .16

The Pareto parametrization specified by Assumption A3 is by far the most common distributional assumption in heterogeneous firm models (e.g., Chaney 2008, Melitz and Ottaviano 2008, Simonovska 2015, Arkolakis et al. 2019). As we will see shortly, the main advantage of Assumption A3 is delivering the gravity equation for aggregate trade flows. The more specific assumption that γ k is uniform within industries is not consequential for the results that follow. Instead, it will allow me to highlight a previously unknown source of variation in the trade elasticities. Moreover, if ρ z is also uniform across industries (i.e., quality and quantity are exactly isomorphic), the quality heterogeneity specified by Assumption A3 can be trivially reinterpreted as productivity heterogeneity (see Melitz and Redding 2014). In that regard, the present framework also nests an extensive class of models featuring heterogeneous productivity firms.

Considering Assumptions A1–A3, firm ω is uniquely characterized by its output quality, φ. So, to simplify the notation without risk of confusion, I henceforth drop firm indices and use lowercase letters to denote firm-level variables as a function of firm-level quality and aggregate variables.

16The support of distribution G j,z (φ) is φ ≥  1 j,/z γ

.

k

Upon entry, a firm can supply a differentiated good in each segment of industry k .17 The supply of differentiated goods is governed by constant returns to scale technologies, whereby firms from country j face the following linear cost function when serving segment z ∈  k in market i :

 (q;τ ji,k , f i,z ,w j ,w i ) = τ ji,k a z w j q + w i f i,z .

In the above expression q is the quantity supplied, with c z ( τ ji,k ,w j ) ≡ τ ji,k a z w j denoting the marginal cost of production and transportation that is paid in terms of

the domestic labor wage, w j . The marginal cost is governed by a segment-specific cost shifter, a z , and an exporter-importer specific term, τ ji,k , that accounts for international differences in production and transportation costs—as shown in Appendix B, the above cost function and the corresponding analysis can be easily generalized to allow for the dependence of  ( · ) on output quality. Finally, w i f i,k accounts for the fixed cost of serving market i which is paid in terms of labor in the destination market. The cost function can be re-formulated in terms of effective output, q̃ , as follows:

 ̃ ( q̃ ,φ;τ ji,k , f i,z ,w j ,w i ) = τ ji,k a z w j q̃ φ

_ ρ z

ρ z −1 + w i f i,z .

Under monopolistic competition, a firm operating with an effective variable cost, c̃ z (φ;τ ji,k ,w j ) ≡ τ ji,k a z w j φ ρ

z /( ρ z −1) , chooses a market and segment-specific effective price p̃ z ( c̃ z ,P i,z , i,z ) in order to maximize profits:

π ( c̃ z ,P i,z , i,z ,L i ) = max p̃ z ( p̃ z − c̃ z ) q̃ ( p̃ z ,P i,z , i,z ) L i ,

where q̃ ( · ) × L i is the total demand facing the firm in market i . Given that there is zero cross-cost passthrough between different varieties of a given firm, the first-order condition for the above problem entails that firms charge a monopoly markup that is inversely related to the underlying demand elasticity (Laffont and Tirole 1993). Considering the optimal markup, a firm will serve segment z of market i , if and only if π ( c̃ z ,P i,z , i,z ,L i ) ≥ w i f i,z .

A. Trade Equilibrium

Below, I characterize equilibrium outcomes at the firm and at the aggregate levels—in doing so, I borrow heavily from the analysis in Arkolakis et al. (2019).18 Subsequently, I present a formal summary of the trade equilibrium.

17By allowing firms to serve multiple segments, the present model accommodates firm heterogeneity in product scope. According to Hottman, Redding, and Weinstein (2016), product quality and product scope, together, explain more than four fifths of the variation in firm sales.
18While I closely follow Arkolakis et al. (2019) in deriving the gravity equation, the present model exhibits key differences with Arkolakis et al. (2019). Most importantly, aside from considering a more general demand system, Arkolakis et al. (2019) is a special case of the present model where ρ z is uniform across segments and equal to 1/ 2. As a result, the pattern of specialization highlighted in Section IIC is absent in the Arkolakis et al. (2019) framework.

Firm-Level Outcomes.—To outline the firm-level outcomes, consider a firm serving segment z in market i , with effective cost c̃ i,z and effective price p̃ i,z —keep in mind that c̃ i,z ≡ c̃ z (φ;τ ji,k ,w j ) is a function of country of origin characteristics and firm-level quality. Following Assumption A2, the Marshallian demand elasticity facing such a firm can be expressed as a function of its effective price, p̃ i,z , relative to a market-level index, P i,z ≡ w i / F i,z , as follows:

ε z ( p̃ i,z / P i,z ) ≡ − ∂ lnz ( p̃ i,z / P i,z ) / ∂ lnp̃ i,z .

Hence, defining ν i,z ≡ P i,z / c̃ i,z , the optimal markup, m i,z ≡ p̃ i,z / c̃ i,z = p i,z / c i,z on any variety sold in segment z can be calculated as

m i,z =

_____________ ε z ( p̃ i,z / P i,z ) ε z ( p̃ i,z / P i,z ) − 1

=

______________ ε z ( m i,z / ν i,z ) ε z ( m i,z / ν i,z ) − 1

.

It is straightforward to verify that if εz′ (x) > 0, there exists a unique markup function, m i,z ≡ μ i,z ( ν i,z ) , that implicitly solves the above equation. More importantly, μ i,z ( · ) does not depend on the firm's country of origin. Altogether, given the demand function specified by Assumption A2, and noting that p̃ i,z = m i,z c̃ i,z with c̃ i,z = P i,z / ν i,z , the total sales of a firm serving segment z ∈  k of market i can be calculated as

_ μ i,z ν ( i ν,z i,z ) P i,z  z ( _ μ i,z ν ( i ν,z i,z ) )  i,z L i . The above expression then implies the following segment-specific profit function:

x ( ν i,z ,P i,z , i,z ,L i ) =

π ( ν i,z ,P i,z , i,z ,L i ) = [ ( μ i,z ( ν i,z ) − 1) / μ i,z ( ν i,z ) ] x ( ν i,z ,P i,z , i,z ,L i ) .

The zero-profit cutoff condition for segment z in market i can, therefore, be expressed as follows:

π ( ν i ⁎,z ,P i,z , i,z ,L i ) = w i f i,z .

The implicit solution to the above equation, ν i ⁎,z , implies a cut-off that depends solely on the characteristics of market i and is independent of the origin country.19 Alternatively, the demand structure specified by Assumption A2, is general enough to admit a choke price, b i,z , such that π i,z = 0 if c̃ i,z ≥ b i,z . So even in the case where f i,z = 0, there exists a cut-off ν i ⁎,z = P i,z / b i,z that is invariant to the characteristics of the origin country.

19The quality cutoff, φ ji ⁎,z , implied by ν i ⁎,z is, however, origin country specific.

Aggregate Outcomes.—Total sales of country j to market i in segment z ∈  k can be calculated as the sum of all firm-level sales as follows:

X ji,z = N j,k ∫

_ ρ z

ρ z −1 ,P i,z , i,z ,L i ) d G j,z (φ) . Noting that (i) d G j,z = γ k  j,z φ − γ

φ x ( P i,z / τ ji,k a z w j φ

k −1 , by Assumption A3, and (ii) ν i,z

_ ρ z

= ( P i,z / τ ji,k a z w j ) φ

1− ρ z , the above equation can be reformulated as

(3) X ji,z = χ i,z  j,z N j,k ( τ ji,k w j ) − γ k (

ρ z −1) ,

_ 1

ν  z (

ν ) ν − γ

∞ ⁎ _ μ i,z (ν)

_ μ i,z (ν)

_ 1− ρ z

ρ z _ dν

where χ i,z ≡ γ k a z −θ z  i,z P i,z ∫ ν

ν .20 Correspondingly, the aggregate segment-level profits collected from sales to market i is given by

k

i,z

Π ji,z = N j,k ∫

_ ρ z

φ π ( P i,z / τ ji,k a z w j φ

ρ z −1 ,P i,z , i,z ,L i ) d G i,z (φ)

= ζ i,z χ i,z  j,z N j,k ( τ ji,k w j ) − γ k (

ρ z −1) ,

_ 1

where

ζ i,z ≡

∫

(

ν )  z (

ν ) ν − γ

∞ _ μ i,z (ν) − 1

_ μ i,z (ν)

_ 1− ρ z

ρ z _ dν

ν

k

ν i ⁎,z

μ i,z (ν)

.

__________________________________

∫

(

ν )  z (

ν ) ν − γ

∞

_ μ i,z (ν)

_ 1− ρ z

ρ z _ dν

ν

k

ν i ⁎,z

The above equations imply that the share of country j 's profits, Π j,k , from total sales,  j,k , in industry k is equal to ζ– j,k ≡ Π j,k /  j,k = ( ∑ i,z∈ 

X ji,z ) . Note that if we were also to assume that ν i ⁎,z is uniform across markets à la Arkolakis et al. (2019), then ζ i,z will be constant and uniform within segment z and profits will be a constant share of total revenue.

ζ i,z X ji,z ) / ( ∑ i,z∈ 

k

Summary of Equilibrium.—A trade equilibrium is an  ×  vector of entrants N ≡ [ N i,k ] , an  × 1 vector of national wage rates, w ≡ [ w i ] , an  ×  vector of industry-level employment, L ≡ [ L i,k ] , an  × 1 vector of income levels, y ≡ [ y i ] , and an N × K ×  vector of market-specific cut–offs, ν ⁎ ≡ [ ν i ⁎,z ] , that satisfy (i) the free entry condition (FE), (ii) the industry and aggregate-level

20The term θ z ≡ γ k (1/ ρ z − 1) denotes the trade elasticity, which I will elaborate on later. Also, in the integrals, the subscript for ν is dropped in the interest of brevity.

labor market clearing conditions (LMC), (iii) the representative consumers budget constraint, and (vi) the zero-profit cutoff condition (ZPC):21

(FE) N j,k = ζ– j,k L j,k / f ke ,



(industry LMC) w j L j,k = ∑

#### ∑

X ji,z + N ji,z w j f j,z − N ij,z w i f i,z , (aggregate LMC) ∑

i=1

z∈  k



L j,k = L j ,

k=1





(BC) y i L i = w i L i = ∑

#### ∑

X in,z ,

n=1

k=1

z∈  k

(ZPC) π ( ν i ⁎,z ,P i,z , i,z ,L i ) = w i f i,z ,

where N ji,z ≡ N j,k (1 −  ji,z ( ν i ⁎,z ) ) denotes the mass of country j firms that can profitably serve market i in segment z ∈  k .22 In constructing the equilibrium, note that given the markup equation, the vector of wages, w = y , fully determines the income levels, the entire schedule of prices in each market, as well as P z ( p ̃ ,y) and  z ( p ̃ ,y) . Also, given that the Marshallian demand must satisfy the budget constraint (BC) of the representative consumer, trade is necessarily balanced, i.e., ∑ n ∑ k ∑ z∈ 

X in,z = ∑ j ∑ k ∑ z∈ 

X ji,z for every country i.

k

Gravity Equation.—An attractive feature of the present framework is that country-level export flows within each segment are characterized by a gravity equation. Specifically, let α i,k denote country i 's expenditure share on industry k , and e i,z denote country i 's (within-industry) expenditure share on segment z . Equation (3) then implies that total export flows from country j to i within segment z of industry k are given by

 j,z N j,k ( τ ji,k w j ) γ k (1−

__1 ρ z )

(4) X ji,z =

e i,z α i,k Y i ,

_________________________

∑ n =1  n,z N n,k ( τ ni,k w n ) γ k (1− __ ρ1 z )

where Y i ≡ w i L i denotes total income in country i . The trade elasticity underlying the above gravity equation, θ z ≡ γ k (1 − 1/ ρ z ) , depends on both (i) the industry-level degree of firm heterogeneity, γ k , and (ii) the segment-level degree of quality intensity, ρ z . Only in the case of exact isomorphism between quality and quantity (which amounts to setting ρ z = 1/ 2, ∀ z ), the trade elasticity becomes identical to the Pareto shape parameter, γ k , as in Chaney (2008).

21The existence and uniqueness of equilibrium in the present setup is nontrivial. But, for a given vector ν ⁎ , the present model falls under the class of multi-industry gravity models studied in Kucheryavyy, Lyn, and RodríguezClare (2016); and satisfies the necessary condition for uniqueness and existence of equilibrium in these modelssee Propositions 2–6 in Kucheryavyy, Lyn, and Rodríguez-Clare (2016).
22d  ji,z (ν) ≡ d G j,z ( τ ji,k a z w j ν 1/ ρ

z −1 / P i,z ) denotes the PDF of ν for country j firms selling to market i in segment z .

B. A Special Case: CES without q-φ Isomorphism

To put the above model in perspective, it is useful to outline a special case of the model with familiar CES preferences. This special case is essentially a variation of Chaney (2008) that relaxes the exact isomorphism between quality and quantity. In this special case, the utility of the representative consumer has the following parametric structure:

_ ε z ε z −1 e i,z α i,k

z∈  k ( ∫

ε z dω)

ε z q ω 1−

K

W i = ∏ k=1

ω∈ Ω i,z φ ω

__ 1

_ 1

∏

,

where α i,k denotes the share of market i 's expenditure on industry k , while e i,z denotes the ( within-industry) share of expenditure on segment z , with ∑ k α i,k = 1 and ∑ z∈ 

e i,z = 1. Based on the above utility, ρ z = 1/ ε z , and the demand function facing firm ω is q̃ ω = ( p̃ ω

k

z dω) e– i,z y i , with the effective quantities and prices defined as q̃ ω ≡ q ω φ ω 1/ ( ε

− ε z / ∑ k ∫ ω′ p̃ ω′ 1− ε

z −1) . One can verify, either directly or by using equation (4), that the gravity equation in this special case adopts the following formulation:

z −1) and p̃ ω ≡ p ω / φ ω 1/ ( ε

________________________  j,z N j,k ( τ ji,k w j ) γ k (1− ε z ) ∑ n =1  n,z N n,k ( τ ni,k w n ) γ k (1− ε z )

e i,z α i,k Y i .

X ji,z =

Based on the above gravity equation, the trade elasticity, θ z = γ k (1 − ε z ) , depends on both the firm-level demand elasticity, ε z , which governs the degree of quality intensity, and the Pareto shape parameter, γ k . In other words, the model predicts a strong link between the degree of firm-level and country-level market powers, which, as I will argue later, has sharp macro-level implications.

C. The Structure of Within-Industry Specialization

The present framework offers a new perspective on the structure ofwithin-industry specialization. As a first step in demonstrating this, I use the gravity equation to determine the composition of country-level exports across segments of an industry. Following equation (4), the relative exports of countries j and n to market i across segments z and z′ ∈  k are given by

________ X ji,z / X ji,z′ X ni,z / X ni,z′

= (  ________ j,z /  j,z′

 n,z /  n,z′ ) ( _ τ ji,k w j

γ k ( __ ρ 1

z )

− _ ρ 1

τ ni,k w n )

(5)

.

z′

Based on equation (5), revealed comparative advantage across segments of an industry is governed by (i) technical comparative advantage through  j,z  n,z′ /  j,z′  n,z , as well as (ii) differences in trade costs, τ, and labor cost, w . The latter margin yields a clear pattern of specialization, which is outlined below.

LEMMA 1 ( ρ -Driven Specialization): Absent technical comparative advantage

(i.e.,  j,z =  j,k  z ), high-w and high- τ economies have a revealed comparative advantage in high- ρ ( quality-intensive) segments of industries.

Lemma 1 is a reduced-form result that follows directly from equation (5), relating a country's export composition to its wage level and geo-location, which are both observable. While Lemma 1 is agnostic to the source of cross-national wage differences, it requires the segment-neutrality of  's, a sufficient condition to rule out cases where low-w or low- τ economies have a technical comparative advantage (i.e., a higher relative ) in high- ρ segments. To the extent that Lemma 1 outlines a pattern of specialization across industry segments with different trade elasticities, θ z ≡ γ k (1 − 1/ ρ z ) , it resembles the theory in Fieler (2011). The present model, however, identifies a novel source of variation in the trade elasticities that operates through ρ z , and that is absent in Fieler (2011) or prior gravity models. Accordingly, specialization across low- and high-trade elasticity segments, in the present model, assumes a different interpretation compared to Fieler (2011).

The next step is to determine how the pattern of specialization outlined by Lemma 1 dictates the structure of export market power and markups. This transition is guided by an important feature of the model where even though markups are variable, the distribution of firm-level markups is invariant to the characteristics of the exporting country. To demonstrate this, let  ji,z (m;w j ,τ ji,k ) = Pr{ μ z ( ν ) ≤ m ∣ ν ≥ ν ⁎ i,z } denote the distribution of firm-level export markups from country j to i in segment z . Appendix C demonstrates that  ji,z is invariant to the exporting country's characteristics, τ ji,k and w j , and is given by

 ji,z (m;w j ,τ ji,k ) =  i,z (m) = 1 − ( ν i,z

⁎ μ i −,z1 (m) )

− γ k ( _ ρ 1

z

−1) .

The distribution of export markups, however, varies systematically across segments and also with the characteristics of the importing country i . The across-market variation in markups is a reflection of pricing-to-market, as in Simonovska (2015). The following lemma summarizes the result concerning the invariance of markup distribution across exporters.

LEMMA 2 (Within-Segment Markup Distribution): The distribution of export markups within segment z is invariant to the exporting country's characteristics.

The intuition behind Lemma 2 is similar to that provided by Arkolakis et al. (2019), which is based on the countervailing effects of higher costs on markups. While lower transport costs, τ ji,k , or labor costs, w j , induce firms from country j to charge higher markups in market i , they also induce lower quality firms from country j to export, and such firms charge lower markups. These two effects exactly offset one another in the Pareto case, leading to the invariance of the markup distribution. Importantly, Lemma 2 indicates that in the extensive class of models covered here (e.g., Melitz and Ottaviano 2008 and Simonovska 2015), variable markups do not create a systematic link between country-level characteristics and the

average export markup within segments. Instead, the industry-level heterogeneity in export markup levels is driven exclusively by the composition of exports across segments.

Lemma 2 points to two important corollaries. First, it entails that the distribution of demand elasticities facing firms exporting from country j to market i is also invariant to country j 's characteristics:  ji,z (m;w j ,τ ji,k ) =  i,z (ε) . Given the invariance of the markup and demand elasticity distributions, I henceforth use ε– i,z and m– i,z to denote the average demand elasticity and markup level associated with segment z in market i :

ε– i,z ≡ ∫

ε εd  i,z (ε) ,

m– i,z ≡ ∫

m md  i,z (m) .

Correspondingly, in each market, we can uniquely rank segments within industry k by their average firm-level markup or elasticity level. Such a ranking is also significant from a macro-level perspective given that, under free entry, m– i,z represents the degree of scale economies in segment z . To elaborate, consider the standard definition of the scale elasticity, which is the elasticity at which firm-level output increases with total input cost: ψ ≡ ∂ lnq / ∂ ln . It is a well-known that if (i) firms are profit-maximizing and (ii) there is free entry ( AC = p ), this elasticity equals the firm-level markup (see Hanoch 1975 and Helpman 1984). In particular,

scale elasticity: ψ ≡ _ AC MC =

_ p MC = _ ε

ε − 1 = m ,

where MC and AC , respectively, denote the marginal cost and average cost at the firm level. Considering the above equation, high-m– or low-ε– segments can be viewed as high-scale intensive. As one may expect, this connection has basic implications for the gains from trade and cross-country income differences—a point I will return to in Section IV.

The final step in establishing my theoretical proposition is to relate ρ-driven specialization (Lemma 1) to the markup and scale-intensity of a nation's exports. To this end, I impose the following testable assumption that links the segment-level quality intensity, ρ z , to its average demand elasticity, ε– i,z .

ASSUMPTION A4 (Relation between ε– and ρ): Firms on average face a lower demand elasticity, ε– i,z , in high-quality-intensive (high-ρ z ) segments of an industry.

The above assumption is already implicit in the CES case outlined in Section IIB, where ρ z = 1/ ε z . Beyond CES, the prior literature has occasionally invoked Assumption A4, with Rodrik (1994), for instance, arguing that quality-intensive segments are less standardized and therefore less price sensitive. But more importantly,

Assumption A4 is empirically testable. In Section III, I employ rich micro-level data to establish that the empirical relationship between quality intensity and the demand elasticity aligns with Assumption A4.

Assumption A4 paired with Lemmas 1 and 2 imply my main proposition that high-income and geographically remote economies specialize in high-markup, high-scale-intensive segments of industries. Appendix C provides a formal proof of this claim, and the following provides a formal statement.

PROPOSITION 1: Absent technical comparative advantage, high-wage and geographically remote economies specialize in high-scale intensive, high-markup segments of industries.

The intuition behind Proposition 1 is straightforward. National industries that face high labor or transportation costs tend to export relatively more in quality-intensive (low trade elasticity) segments were aggregate sales are less sensitive to aggregate cost measures. Per Assumption A4, high quality-intensive segments are also those in which firms, as an individual unit, enjoy greater market power and set higher markups. Altogether, these two patterns yield the proposition stated above. To better understand Proposition 1, bear in mind that higher wage levels in the present model are a reflection of the economy being agglomerated with high-quality firms (which will be reflected in a higher  ). Proposition 1, merely states that an agglomeration of high-quality firms in an economy puts upward pressure on the local wage, inducing firms to specialize in high-market power, high-scale-intensive segments of industries.

Discussion.—Assumptions A1–A4 outline a set of sufficient conditions that deliver Proposition 1. Of these, Assumption A1 is less consequential. It relaxes the quality-quantity isomorphism in favor of a less-restrictive assumption to identify a new source of variation in the trade elasticities. Assumptions A2 and A3 are conventional assumptions that deliver the gravity equation—as noted earlier, both assumptions underlie an important class of constant and variable markup trade models. Assumption A4, however, is a major assumption that can be tested with micro-level data, and the following section is dedicated to this task.

Note, however, that even a weaker version of Assumption A4 will still yield Proposition 1. Specifically, as long as the segment-wide trade elasticity θ z ≡ γ k (1 − 1/ ρ z ) is positively correlated with the average firm-level demand elasticity, ε– i,z , the pattern of revealed comparative advantage outlined by Lemma 1, will yield Proposition 1. The same applies even if the Pareto shape parameter, γ, is segment-specific.

Finally, the specialization pattern outlined by Proposition 1 can concurrently occur at various levels of aggregation. In the interest of consistency, the theory was cast here as one of within industry specialization. But one can alternatively assume that A1–A4 hold across (rather than within) industries to arrive at a similar proposition regarding across-industry specialization. In that case, high-wage and remote economies specialize in high-market power, high-scale-intensive industries.

D. An Alternative View on Export Price Levels

Proposition 1 offers a new perspective on the previously highlighted facts (i) and (ii): that export price levels increase with distance and the income per capita of the exporting country. As noted before, the dominant view in the literature is that these facts are driven by quality specialization. Against the backdrop of this dominant view, Proposition 1 highlights the markup channel as an alternative driver of the aforementioned facts, while Lemma 1 provides an alternative view on the origins of quality specialization.

To elaborate, note that the average free on board price of exports from country j to market i in industry k can be decomposed as follows:

p– ji,k = μ – ( τ ji,k ,w j ) markup-driven

× a– ( τ ji,k ,w j ) w j

,



cost-driven

where a– ( τ ji,k ,w j ) ≡ ∑ z∈ 

m– i,z r ji,z ( τ ji,k ,w j ) , respectively, denote the average markup and unit labor cost embedded in country j 's industry-level exports to markets i , with r ji,z ≡ X ji,z / ∑ z∈ 

a z r ji,z ( τ ji,k ,w j ) and μ– ( τ ji,k ,w j ) ≡ ∑ z∈ 

k

X ji,z being the share of segment z from total industry-level export sales. In the above expression, the markup-driven competent, μ– (τ,w) , has a clear connotation. The cost-driven term, a– (τ,w) w , meanwhile, primarily accounts for quality differences. In particular, a– (τ,w )w may be higher due to (i) a higher wage, which reflects an agglomeration of high-quality firms in the economy, or (ii) a higher a– ( · ) , which reflects a higher share of exports in high-quality-intensive segments, provided that az is increasing in the degree of quality intensity, ρ z .

k

From the lens of the present model, the markup-driven term, alone, can explain why exports from rich and distant suppliers exhibit higher price levels. In particular, Proposition 1 asserts that ∂ 2 r ji,z / ∂ τ ji,k ∂ m– i,z > 0 and ∂ 2 r ji,z / ∂ w j ∂ m– i,z > 0, which along with the fact that ∑ z∈ 

r ji,z = 1, imply the following:

k

__________ ∂ μ– ( τ ji,k ,w j ) ∂ τ ji,k

_____ ∂ r ji,z ∂ τ ji,k

#### = ∑

0,

(6)

m– i,z

z∈  k

__________ ∂ μ– ( τ ji,k ,w j ) ∂ w j

____ ∂ r ji,z ∂ w j

#### = ∑

0.

m– i,z

z∈  k

Based on the above, the "Washington apples" effect, ∂ p– ji,k / ∂ τ ji,k > 0, is partly driven by distant exporters sorting into high-markup segments; or the observation

that ∂ p– ji,k / ∂ w j > 0 is reflective of high-income economies exporting relatively more goods pertaining to high-markup segments of industries. Even though this mechanism concerns export markups, it operates orthogonal to the well-known variable markup mechanism. As implied by Lemma 2, markups can be variable across firms, but the distribution of markups, within segments, is the same for all exporting countries. That being the case, variable markups can explain why Australian firms set higher prices in the US market than in China, but not why the

US imports higher price goods from Australia than from, say, Canada. The above mechanism, by contrast, explains this exact type of variation.

Finally, in the present model, the quality margin also contributes to the positive

relationship between τ ji,k , w j , and p– ji,k , but through a mechanism that is distinct from those specified in the prior literature. Specifically, provided that high-

quality-intensive (high- ρ) segments involve higher productions cost, a z , Lemma 1 implies that ∂ a– (τ,w) / ∂ τ > 0 and ∂ a– (τ,w) / ∂ w > 0. That is, absent markup heterogeneity, a higher τ ji,k or w j increases the aggregate export price level, p– ji,k , by inducing specialization in quality-intensive segments.

III. Micro-Level Evidence

Even if the full schedule of export markups were observable, testing Proposition 1 directly would remain challenging as it requires information on the full vector of deep technology parameters, A ≡ [  i,z ] . To elaborate, suppose the data suggests that high-wage and remote economies export relatively more in high-markup segments. It may very well be the case that they have a factor-driven or a historical scale-driven comparative advantage in these segments, which will be reflected in a higher  . Alternatively, high-wage and remote economies may have a fundamental comparative disadvantage in high-markup segments, which will negate the mechanism highlighted in this paper.

Despite such complications, one can still indirectly test Proposition 1 by evaluating the sufficient conditions that lead to it. As noted earlier, of the assumptions leading to Proposition 1, Assumption A1 is not consequential; Assumptions A2 and A3 are standard assumptions that deliver the gravity equation (see Arkolakis et al. 2019); but Assumption A4 is a unique assumption that is empirically testable. Considering this, I first conduct a formal test of Assumption A4 in this section. I then present direct evidence that factual export patterns do in fact comply with Proposition 1. Finally, I discuss the limitations of my analysis and provide auxiliary evidence on across-segment specialization.

A. Data Description

The empirical analysis that follows uses import transactions data from the Colombian Customs Office for the 2007–2013 period.23 The data covers the universe of all import transactions in Colombia, reporting the following information per transaction: date of transaction, Harmonized System 10-digit product category (HS10), importing and exporting firm's IDs,24 free on board values of shipments in US dollars, quantity and the unit it is measured in, freight in US dollars, insurance

23The data is obtained from Datamyne, a company that specializes in documenting import and export transactions in the Americas. For more detail, please see www.datamyne.com.
24The identification of the Colombian importing firms is standardized by the national tax ID number. For the foreign exporting firms, the data provide the name of the firm, phone number, and address. The names of the firms are not standardized, and thus there are instances in which the name of a firm and its address are recorded differently (e.g., using abbreviations, capital and lower-case letters, dashes, etc.). To deal with this problem, I follow the guidelines in Lashkaripour and Lugovskyy (2019), which involves standardizing the spelling and the length of the names along with utilizing the data on firms' phone numbers.

in US dollars, import tariff and value-added tax rates, and country of origin. The online Appendix (Table 1) reports a summary of basic trade statistics for this dataset. A possible challenge when working with this dataset is that Colombia has been changing the HS10 classification for some products between 2007 and 2013. Fortunately, the Colombian Statistical Agency, DANE, has kept track of these changes, permitting the concordance of Colombian HS10 codes over time—the concordance procedure closely follows the guidelines in Lashkaripour and Lugovskyy(2019), with changes in HS10 codes affecting less than 0.1 percent of the observations between 2007 and 2013. My analysis also requires (i) data on monthly average exchange rates, which are taken from the Bank of Canada; (ii) data on national accounts that are taken from the Penn World Table version 9; and (iii) data on geo-distance, which are taken from the Cepii database (Mayer and Zignago 2011).25

To map theory to data, I take a similar approach to Holmes and Stevens (2014), using the 4-digit Standard Industrial Classification (SIC) to define industries and the 10-digit HS10 product classification to specify segments within an industry. To fix, consider the following group of HS10 product segments: "passenger cars with 1000–1500cc engines," "passenger cars with larger than 3000cc engines," "limousines," "tractors," "golf carts," and so on. Based on the Standard Industrial Classification, these segments are grouped together into SIC 3711, which corresponds to "motor vehicles and passenger car bodies." The analysis that follows concerns specialization across these narrowly defined HS10 segments that are grouped into broadly defined SIC industries.

B. Estimating the Segment-Level Demand Elasticity: ε– z For every variety ω,z t exported to Colombia by firm ω , in HS10 segment z , in

year t , I observe export quantity, q ω,z t , as well as the free on board value plus transportation costs and tariff/tax charges, which can be used to construct the consumer

price, p ω,z t , associated with that variety. Given data on firm-level quantities and prices, I estimate the average demand elasticity facing firms in segment z using the following estimating equation:

(7) lnq ω,z t = − ε– z lnp ω,z t + δ ωz + δ z t + ξ ω,z t ,

where δ ω z and δ z t control for firm-product and product-year fixed effects, and ξ ω,z t represent non-price-demand shifters such as measurement errors or time-varying

components of quality, φ ω,z t .

To eliminate the firm-product fixed effects, I take first-differences from equation (7) to arrive at the following estimating equation:

(8) Δ lnq ω,z t = − ε– z Δ lnp ω,z t + Δδ zt + Δ ξ ω,z t .

25See http://www.bankofcanada.ca/rates/exchange/monthly-average-lookup/.

Table 2—Summary of the Estimated Segment-Level Elasticities

First quartile

Thrid quartile

Number of sig. estimates

Mean Median

Panel A. Segment-level demand elasticity: ε– z OLS 1.13 0.90 0.61 1.23 4,391

(0.11) (0.25) (0.16)

2SLS 2.74 1.88 1.20 3.03 1,283

(1.22) (0.68) (2.61)

LIML 2.78 1.93 1.21 3.26 1,627 (1.26) (0.17) (1.89)

Panel B. Segment-level trade elasticity: θ z ≡ (1/ ρ z − 1) γ k OLS 1.05 0.76 0.42 1.28 2,706

(0.14) (0.28) (0.36) 2SLS 2.24 1.69 1.05 2.75 1,827 (0.77) (0.11) (1.88) LIML 2.58 1.89 1.16 3.21 1,701 (0.50) (0.69) (1.48)

The first-difference estimator exhibits a couple of attractive features for the purpose of my analysis: (i) it eliminates observations pertaining to one-time exporters, (ii) it is more efficient than the fixed effect estimator when ξ ω,k t s are sequentially correlated, and (iii) it requires a weaker orthogonality condition, 피 [Δ lnp ω,z t ∣ Δ ξ ω,z t ] = 0, than the fixed effect estimator.

As a first step, I separately estimate equation (8) for 5,669 HS10 segments using the first-difference OLS estimator. For 4,391 of these HS10 segments, the estimation produces a positive elasticity, ε– z , (i.e., a negative coefficient on unit price) with a robust t-statistic that is greater than one.26 A basic summary of the OLS-estimated demand elasticities is reported in the first row of Table 2.

IV Estimation.—A general concern when estimating equation (8) is that demand

shocks, Δ ξ ω,z t , may be correlated with longitudinal changes in price, Δ lnp ω,z t . Such a correlation, which may be driven by measurement errors or by quality upgrading,

leads to a violation of the orthogonality condition, 피 [Δ lnp ω,z t ∣ Δ ξ ω,z t ] ≠ 0. The standard approach to handling such a concern is to instrument for Δ lnp ω,z t with a plausibly exogenous cost shifter that is orthogonal to Δ ξ ω,z t . Changes in import tariff rates are often considered to be an appropriate instrument in these circumstances. But since equation (8) is being estimated at the firm level, import tariffs do not qualify as a strong instrument, as they exhibit little-to-no variation across firms from the same country.

To overcome the identification challenge, I use the methodology in

Lashkaripour and Lugovskyy (2019), which is to instrument for Δ lnp ω,z t using a plausibly exogenous measure of exposure to aggregate exchange rate shocks.27

26Throughout this section, I drop elasticity estimates that exhibit a robust t-statistic below or equal to one.

My analysis, however, produces qualitatively similar results when I drop elasticities that exhibit a robust t-statistic below or equal to two.

27The estimation conducted here differs from Lashkaripour and Lugovskyy (2019) in two basic aspects.

First, Lashkaripour and Lugovskyy (2019) jointly estimate the country-level and firm-level demand elasticities,

This procedure involves compiling an external database on aggregate monthly exchange rates, and interacting the monthly variation in aggregate exchange rates with the monthly composition of firm-level exports in the prior two years. Doing so delivers the following shift-share instrument:

12

 ω,z t = ∑

#### ∑

Δ E jt (mo) × r ω,z t−l (mo) ,

l=1,2

mo=1

with Δ E jt (mo) denoting the change in country j 's exchange rate with Colombia in month mo of year t ; and r ω,z t−l (mo) and denotes the prior share of firm ω's month mo sales in year t − l . To elaborate,  ω,z t measures the variety-specific exposure to exchange rate shocks. It builds on the fact that aggregate exchange rate movements may have differential effects on firms, depending on the monthly composition their export sales. In that regard,  ω,z t resembles the widely used Bartik instrument. Perhaps most importantly,  ω,z t captures a plausibly exogenous source of variation in firm-level costs, provided that aggregate exchange rates and historical export patterns are independent of concurrent demand shocks.

I conduct a battery of tests to diagnose the strength of my price instruments in Table 3. Given that errors may be heteroskedastic, I use the heteroskedasticity-robust Kleibergen and Paap (2006) test statistics to diagnose underidentification and weak instruments. Based on the Kleibergen-Paap LM statistic, I can reject the null hypothesis that the model is unidentified. To test if the instruments are only weakly correlated with unit price, I also report heteroskedasticity-robust Kleibergen and Paap (2006) Wald rk F-statistic. While critical values for the Cragg and Donald (1993) F-statistic have been provided by Stock and Yogo (2002) in the homoskedastic case, there is no tabulation of such values for the heteroskedastic case. So, as is common in the literature, I compare the heteroskedasticity-robust statistic to the Stock and Yogo (2002) critical values, which allows me to reject the null of weak instruments. Finally, Table 3 also reports the Anderson and Rubin (1949) Wald test, which strictly rejects the null that the coefficients on the excluded instruments are jointly zero when they are included in place of unit price, Δ lnp ω,z t , in the estimating equation (8).

The results attained under the 2SLS estimator are reported in the second row of Table 2. Encouragingly, the 2SLS estimates are lower than the OLS estimates, which suggest that the instruments are operating in the right direction. However, as expected, the standard errors corresponding to the 2SLS estimation are higher, so that for only 1,283 HS10 segments the estimated elasticity is positive and exhibits a robust t-statistic greater than one. Given that HS10 segments may feature a small sample of firm-level observations, I also estimate equation (8) using the Limited Information Maximum Likelihood (LIML) estimator, which is more robust than 2SLS in the presence of weak instruments and small sample bias (Stock and Yogo

by assuming a nested-CES demand system. Second, given the focus of that paper, the elasticities are estimated at WIOD industry level of aggregation. Here, I neither impose that the underlying demand is CES nor do I assume that the elasticities are uniform within industries. As a result, I can estimate reduced-form demand elasticities at the HS10 level of aggregation, in order to test Assumption A4. However, unlike Lashkaripour and Lugovskyy (2019), the estimated elasticities no longer assume a structural interpretation and the country-level trade elasticities have to be estimated separately from the firm-level demand elasticities.

Table 3—First-Stage Diagnostics: Pooled Sample

Estimating parameter Demand elasticity ε– z

Trade elasticity θ z

Kleibergen-Paap LM statistic ( under-identification) 373.58 1,589.54 Kleibergen-Paap LM p-value 0.000 0.000

Kleibergen-Paap Wald rk F-statistic (weak instruments) 121.54 494.14 Anderson-Rubin Wald test F-statistic 235.72 2,223.92 Anderson-Rubin Wald test p-value 0.000 0.000 Observations 1,341,480 500,064

2002). The results corresponding to the LIML estimation are summarized in the third row of Table 2 and are qualitatively similar to the 2SLS estimates.

C. Estimating the Segment-Level Quality Intensity: ρ z

Using the gravity equation one can estimate a segment-specific trade elasticity, θ z = γ k (1/ ρ z − 1) , which determines ρ z up to an industry-wide Pareto shape parameter, γ k . There is an extensive literature on estimating the trade elasticity, and what follows builds heavily on this literature. To express the estimating equation, I take logs from equation (3); drop the importer subscript i as I am dealing with data from a unique market; and write the equation in stochastic form. Altogether, country j 's sales-per-firm to the Colombian market in HS10 segment z , in year t , can be stated as

(9) lnX j,z t / N j,k t = − γ k ( _ 1

ρ z − 1) lnτ j,z t w j,t + δ z t + ξ j,z t , z ∈  k ,

where δ z t = lnχ z t can be treated as an HS10-year fixed effect and ξ j,z t is an unobserved error term that, among other things, encompasses measurement errors

or longitudinal variations in national product quality,  j,z t . Based on Head and Mayer's (2014) survey of the literature, there are multiple approaches to estimating the above equation. The first approach, which is more suited to cross-sectional data, is to control for w j using exporter fixed effects and identify the trade elasticity using the variation in transport costs (e.g., Hummels 2001, Hertel et al. 2007) or tariff rates (e.g., Romalis 2007, Caliendo and Parro 2014). An alternative approach for panel data is to control for τ j,z t w j,t using the average unit price of country-level exports, p– j,z t = m– z t a z t τ j,z t w j,t (e.g., Erkel-Rousse and Mirza 2002, Broda and Weinstein 2006); and to instrument for p– j,z t in order to address the possible endogeneity concern. Given the panel structure of my data, I adopt the latter approach.

To estimate equation (9), all firm-level variables are aggregated up to construct country-level variables. This procedure leaves me with 500,064 observations with the unit of observation being exporting country–HS10 segment–year. For each observation, I have the average sales per firm X j,z t / N j,k t and the average

price, p– j,z t = m– z t a z t τ j,z t w j,t , associated with these sales. Using these variables, I first estimate equation (9) using an OLS estimator, with a summary of the estimation results reported in the fourth row of Table 2.28 The results indicate that for 2,706 HS10 segments, the estimated trade elasticity is positive and exhibits a robust t-statistic greater than one. Perhaps expectedly, the median trade elasticity is also lower than the median demand elasticity facing individual firms.

As with the firm-level estimation, one may be concerned that the orthogonality condition, 피 [ p– j,z t ∣ ξ j,z t ] = 0, is violated due to measurement error or longitudinal changes in the national quality distribution,  j,z t . To address these concerns, I also estimate equation (9) using a 2SLS estimator, where I instrument for p– j,z t with the exporter-specific tariff and value-added tax rates. Here, the identifying assumption, as in Caliendo and Parro (2014), is that import tax rates are plausibly exogenous to ξ j,z t . Correspondingly, the second column in Table 3 reports a set of first-stage diagnosis tests, which reject the null hypothesis of weak instruments or the model being unidentified.

The 2SLS estimates of θ z = γ k (1/ ρ z − 1) are summarized in the fifth row of Table 2. The 2SLS estimates are encouragingly lower than the OLS estimates but exhibit higher standard errors. Overall, the 2SLS-estimated trade elasticities are positive and exhibit a robust t-statistic greater than one for only 1,827 HS10 segments. As before, I also estimate equation (8) using the LIML estimator, which is more robust than the 2SLS estimator in the presence of weak instruments and small sample bias. The results, which are summarized in the last row of Table 2, are qualitatively similar to the 2SLS case. Generally speaking, my trade elasticity estimates are slightly lower than those produced by Caliendo and Parro (2014) and Simonovska and Waugh (2011). These differences may stem from the fact that (i) I explicitly control for the number of firms, N j,k t , instead of treating countries as one integrated entity; and (ii) I estimate the trade elasticity at a more refined level of aggregation, which circumvents some of the aggregation biases highlighted in Imbs and Mejean (2015).

Before employing the estimated elasticities to test my theory, let me briefly discuss the plausibility of my estimates. In the online Appendix (Table 2), I report the average estimated demand elasticity and quality intensity for a range of broadly defined industries. The quality intensity is simply measured as ρ z = 1/ (1 + θ z / γ k ) , where γ k is assigned a value of 2.46 based on the estimates of Eaton et al. (2010). Based on my estimates, the "Machinery" and "Electrical and Optical Equipment" sectors are comprised of the most quality-intensive HS10 segments, while the "Food" and "Paper" sectors are comprised of the least quality-intensive HS10 segments. These results sit well with evidence from consumer surveys that durables are more quality intensive than nondurables (Fornell et al. 1996). Similarly, the "Machinery" and "Electrical and Optical Equipment" sectors are comprised of the least price-elastic segments, while the "Food" and "Agriculture and Mining" sectors include the most price-elastic HS10 segments. These rankings align well with the classification in Campa and Goldberg (1995).

− 1) ln p– j,z t + δ̃ z t + ξ j,z t .

28Stated formally, the estimating equation is lnX j,z t / N j,k t = − γ k ( _ ρ 1

z

Table 4—The Relationship between Quality-Intensity and Demand Elasticity (Assumption A4) Dependent: trade elasticity Dependent: quality intensity

Regressor OLS 2SLS LIML OLS 2SLS LIML Demand elasticity, logε– z 0.346 0.250 0.177 −0.102 −0.115 −0.084

(0.029) (0.051) (0.046) (0.007) (0.023) (0.021)

Observations 1,851 435 516 1,851 435 516 Number of SIC Industry FE 95 52 63 95 52 63 R 2 0.45 0.28 0.20 0.39 0.27 0.20

D. Testing Assumption A4: The Link between ρ z and ε– z

The segment-level trade elasticity, θ z = (1/ ρ z − 1) γ k , identifies the segment-level quality intensity up to an industry-wide Pareto shape parameter, γ k . Hence, to test Assumption A4, I can use the estimated HS10-level trade and demand elasticities ( θ z and ε– z ), to run the following regression:29

(10) ln(1/ ρ z − 1) γ k = β ⋅ ln ε– z + δ k + ξ z , z ∈  k

where δ k controls for SIC industry fixed effects. I estimate the above relationship using the elasticities attained under all estimators—namely, OLS, 2SLS, and LIML. As reported in Table 4, in all incidences the segment-level trade elasticity is positively correlated with the average demand elasticity, ε– z , facing individual firms in that segment, with the relationship being statistically significant at the 99 percent confidence level. This positive relationship assures that high-quality-intensive (high-ρ z ) segments of the industry exhibit a lower demand elasticity, which is the assertion of Assumption A4.

When running regression (10), the value of γ k is irrelevant as it is absorbed by the industry fixed effect. However, I can alternatively test Assumption A4 by calculating ρ z = 1/ (1 + θ z / γ k ) based on γ k = 2.46 from Eaton, Kortum, and Kramarz (2011), and running the following regression instead:

lnρ z = β ⋅ lnε– z + δ k + ξ z , z ∈  k .

The estimation results corresponding to the above regression are reported in the right panel of Table 4. Expectedly, this approach delivers a similar set of results, pointing to a strong, negative relationship between quality-intensity and demand elasticity across segments, which again corroborates Assumption A4.

As a clarifying point, note that the number of observations in Table 4 varies across columns because each of the OLS, 2SLS, and LIML estimators produces a different set of plausible and statistically significant trade and demand elasticity estimates.

29Assumption A4 only requires that ρ z and ε– z be positively related. The parametric form of the relationship is inconsequential. Nonetheless, the online Appendix (Table 4) also reports the relationship between ρ z and ε– z in levels (rather than in logs), which yields similar outcomes.

For 1,851 HS10 segments (spanning 95 SIC industries) the OLS estimation delivers a statistically significant estimate of both ε– z and θ z , which can be used to estimate equation (10). The 2SLS and LIML estimators, while conceivably more reliable, produce statistically significant trade and demand elasticities for a fewer number of SIC industries and HS10 segments.30

E. The Composition of Industry-Level Exports

The previous section presented empirical support for Assumption A4, as the key condition that prompts across-segment specialization in Proposition 1. Now, I turn to directly testing the outcomes predicted by Proposition 1 and by Lemma 1. To test Lemma 1, I use my trade elasticity estimates to calculate the degree of quality intensity, ρ z , per HS10 segment in order to construct the following measure:

__________ ρ z X i,z t ∑ z∈ 

∑ z∈ 

ρ– j,k t =

.

k

X i,z t

k

To be more specific, ρ– j,k t measures the average quality intensity of country j 's exports to Colombia in SIC industry k . Lemma 1 predicts that ρ– j,k t is increasing in country j 's income per worker, w j,t , and its distance to Colombia, Dist j ; which I can test by running the following regression:

lnρ– j,k t = β 1 lnw j,t + β 2 lnDist j + β 3 lnY j,t + δ k t + ξ j,k t .

In the above regression, δ k t accounts for industry-year fixed effects, and Y j,t controls for country j 's total GDP. The results corresponding to this regression are reported in the left panel of Table 5, and confirm the prediction that high-wage and distant economies export relatively more in high-quality-intensive segments of each industry.

Similarly, to test Proposition 1, I use my estimated HS10-level demand elasticities to construct the following measure:

 ̅ j,k t =

___________ X i,z t / ε– z ∑ z∈ 

∑ z∈ 

,

k

X i,z t

k

which reflects the average market power of firms' exporting from country j to Colombia in SIC industry k . Proposition 1 predicts that  ̅ j,kt is increasing in country j 's income per worker, w j,t , and its distance to Colombia, Dist j , which I can test by running the following regression:

ln  ̅ j,k t = β 1 lnw j,t + β 2 lnDist j + β 3 lnY j,t + δ k t + ξ j,k t .

30Industries that produce significant 2SLS and LIML estimates are typically those with significantly more observations. This is important to note as one may be concerned with the issue of endogenous selection.

Table 5—The Composition of Country-Level Exports to Colombia Average quality intensity Average market power

Regressor OLS 2SLS LIML OLS 2SLS LIML lnw j 0.005 0.001 0.004 0.013 0.021 0.031

(0.001) (0.002) (0.002) (0.002) (0.004) (0.004)

lnDistj 0.014 0.023 0.027 0.017 0.029 0.028

(0.001) (0.002) (0.002) (0.003) (0.004) (0.005)

lnY j −0.005 −0.005 −0.006 −0.010 −0.019 −0.028 (0.001) (0.001) (0.001) (0.001) (0.002) (0.003)

Observations 16,207 15,582 15,353 23,095 14,896 16,555 R 2 0.59 0.59 0.56 0.61 0.55 0.50

As before, δ k t accounts for industry-year fixed effects and Y j,t controls for county j 's total GDP. The results corresponding to this regression are reported in the right panel of Table 5, and again decisively confirm the prediction that high-wage and distant economies export relatively more in high-market power segments of each industry. While these results are encouraging, one should exercise caution when interpreting them. Specifically, as noted earlier, the findings presented in Table 5 may be an artifact of factor-driven specialization. That is, it can be the case that distant and high-wage economies have a factor-driven comparative advantage in low- θ and low-ε segments, which reinforces the mechanism highlighted in this paper.

F. Discussion and Auxiliary Evidence

Here, I will briefly discuss the extensions and limitations of my analysis. Before doing so, let me take stock. This section estimated two key elasticities for a range of HS10 segments: (i) the trade elasticity θ z , which reflects the collective market power of a country in segment z ; and (ii) the firm-level demand elasticity, ε– z , which reflects the average market power of individual firms in that segment. Following Chaney (2008), it has been widely believed that the link between θ z and ε– z is theoretically unclear. The results in this section, however, established a positive empirical link between θ z and ε– z . This relationship plus trade flows being governed by a gravity equation suffice to prompt specialization based on the mechanism introduced in this paper.

The assumption that γ k does not vary across segments is not consequential to the conclusions in this section. More specifically, the across segment variation in the trade elasticity, θ z , may very well reflect across-segment heterogeneity in γ . But this does not cast a problem for my analysis, as Proposition 1 holds so long as θ z is positively correlated with ε– z across segments and irrespective of what triggers the variation in θ z .

Looking closer at the evidence, the theory developed here can also explain across-industry and within-segment specialization. Specifically, as shown in the online Appendix (Table 3), the negative relationship between ρ z and ε– z prevails not only within industries but also across industries. Per Proposition 1, such a relationship will prompt specialization across low- and high-markup (or low- and

Table 6—Evidence on Within-Segment Specialization

Estimator Regressor LIML 2SLS OLS Δlnp ω,z t (− ε c ) −5.509 −3.787 −0.702

(0.733) (0.355) (0.005)

Δlnp ω,z t × lnDist j (− ε τ ) 4.456 3.001 −0.109

(1.246) (0.591) (0.009)

Δlnp ω,z t × lnw j (− ε w ) 16.586 9.804 0.562

(2.018) (0.834) (0.012)

Kleibergen-Paap LM statistic 235.4 ... Kleibergen-Paap LM p-value 0.000 ... Kleibergen Paap Wald rk F-statistic 25.32 ... Anderson-Rubin Wald F-statistic 82.22 ... Anderson-Rubin Wald p-value 0.000 ...

Observations 1,279,875

high-scale-intensive) industries. Moreover, a similar pattern of specialization may occur within HS10 segments. In fact, I find basic evidence that high-wage and distant economies specialize in high-markup niches of HS10 segments. The evidence is based on the following demand estimation, which admits elasticity heterogeneity across suppliers within an HS10 segment:

Δlnq ω,z t = − ( ε c + ε τ ⋅ lnDist j + ε w ⋅ lnw j ) Δlnp ω,z t + Δδ z t + Δ ξ ω,z t , ω ∈ Ω j,z t . In the above equation, Δ δ z t , as before, controls for HS10-year fixed effects, while Dist j and w j denote the normalized distance and income per worker of country j , which applies to each firm ω exporting from that country. I use the same methodology, explained earlier, to estimate the above equation. The estimation results presented in Table 6 indicate that firm-level exports from high-wage and distant economies face significantly lower demand elasticities, which is indicative of specialization in high-market power niches within HS10 segments.

The obvious limitation of the present analysis is its reliance on the monopolistic competition assumption. This assumption is widely used but difficult to test. Importantly, the link between the average firm-level demand elasticity and the average markup should be amended if the monopolistic competition assumption is violated. That being said, in line with Hottman, Redding, and Weinstein (2016), I find that in most product categories in my sample, the number of exporting firms is large enough to render the monopolistic competition assumption plausible. The other shortcoming, which is due to a lack of better data, is the inability to test Lemma 1 and Proposition 1, while controlling for factor-driven specialization at the firm level.

IV. Macro-Level Implications

Within-industry specialization based on Proposition 1 determines the distribution of economic activity across industry segments that differ in their (i) degree of

market power, (ii) degree of returns to scale, and (iii) trade elasticity. The latter two characteristics determine the extent to which countries can gain from international specialization or trade. So, how national resources are divided along these margins can profoundly influence macroeconomic outcomes. Below, I fit my theoretical model to aggregate trade data to quantify these macro-level implications.

A. Quantitative Strategy

To set the stage for a transparent presentation of macro-level implications, I first describe the quantitative strategy by which the model is fitted to data. The strategy is closely related to a structural gravity estimation à la Anderson and van Wincoop (2003).31 That is, I identify the structural parameters of the model by fitting it to the matrix of aggregate bilateral trade values. However, unlike the standard structural gravity estimation, I allow for industries to be comprised of multiple segments with different trade elasticities.

I estimate the model using two distinct datasets. First, the 2008 version of the World Input Output Database (WIOD), which reports industry-level trade and production data for 32 major economies across 34 industries (15 of which are tradable). Estimating the model with this dataset will allow me to quantify the gains from within-industry specialization. Second, I also estimate the model using country-level trade and production data, which covers 100 economies in the year 2000. Estimating the model with this second dataset allows me to study cross-country TFP differences for a wider range of economies. Appendix D provides a more detailed description of both datasets.

Parametric Restrictions.—The theoretical model presented in Section II only required preferences to be additively separable across industries. But to map the model to data, I need to impose more specific structure on the cross-industry utility aggregator. Considering this, I impose the following restriction, which allows me to estimate the model separately for each industry.

R1. The cross-industry utility aggregator is Cobb-Douglas. That is,

W i = ∏  k=1  α i,k

, where ∑ k α i,k = 1.

i,k

The above restriction is a standard staple in the quantitative trade literature (see Costinot and Rodríguez-Clare 2014). Since the industry-level expenditure shares, α ≡ [ α i,k ] , are observable, Restriction R1 reduces the estimation problem to that of matching an  × ( − 1) matrix of trade values, X k ≡ [ X ji,k ] , separately for each industry k .

To characterize the industry-level trade values, I further impose the following restriction on the demand structure across segments within an industry.

31See Head and Mayer (2014) for a review of the literature on structural gravity estimation.

R2. Each industry k is comprised of two segments,  k = {,} ; with e i,z

denoting the constant share of expenditure on segment z in country i .

Unlike α k , the segment-level expenditure shares, e k ≡ [ e i,z ] , are not observable. So, I treat them as free-moving parameters that are estimated. Given R1 and R2, the

industry-level bilateral trade values, X ji,k , can be calculated as

X ji,k = ∑

λ ji,k e i,z α i,k w i L i

z∈  k

where λ ji,z denotes the share of country i 's expenditure on country j varieties in segment z of industry k . Following the gravity equation (4), derived in Section II,

λ ji,z can be expressed as follows:

__________________  j,k ( w j τ ji,k ) − θ z ∑ n =1  n,k ( w n τ ni,k ) − θ z

(11) λ ji,z =

, z ∈  k .

where θ z = γ k ( ε z − 1) denotes the segment-level trade elasticity, and  j,k ≡  j,k M j,k . The implicit assumption underlying the above equation is that technology parameters are segment-neutral, i.e.,  j,z =  j,k  z . Correspondingly,  z drops out of the gravity equation and  j,k ≡  j,k M j,k can be treated as an industry × exporter fixed effect that is estimated. Considering the above, λ z ≡ [ λ ji,z ] is fully determined as a function of the segment-level trade elasticity, θ z , country-level wages, w ≡ [ wj ] , the vector of exporter fixed effects,  k ≡ [  j,k ] , and the industry-wide matrix of variable trade costs τ k ≡ [ τ ji,k ] .

If one were to treat the elements of τ k as free-moving parameters, the estimation will be overidentified, as the matrix of trade flows, X k = [ X ji,k ] , can be perfectly matched by freely choosing the elements of τ k . So, in line with the literature, I handle the overidentification problem by imposing the following parametric restriction on the bilateral trade costs.

R3. The iceberg trade costs assume the following parametrization:

τ ji,k = β k ⋅ β border,k ⋅ β lang,k ⋅ β agreement,k ( Dist ji ) β dist,k ,

where Dist ji denotes the distance (in thousands of kilometers) between countries j and i . The terms β border , β lang , β agreement are, respectively, "one" if countries j and i do not have a common border, common language, and trade agreement, and an estimated parameter otherwise.

An analogue of the above parameterization, which imposes symmetry in trade costs, underlies many applications including Anderson and van Wincoop (2003),

Fieler (2011), and Caliendo and Parro (2014).32 Based on R3, the matrix of trade costs in industry k is fully characterized by a vector of parameters, β k ≡ { β k ,β border,k ,β lang,k ,β agreement,k ,β dist,k } , plus data on distance, trade agreements, common-language, and borders, which I henceforth refer to as, D . Correspondingly, given equation (11), the matrix of trade shares in segment z ∈  k can be fully calculated as function of w and D , and estimated parameters  k , β k , and θ z ; namely, λ z = λ ( θ z ,β k , k ;w,D) .

Subsequently, the entire matrix of industry-level trade flows can be computed

as a function of free-moving parameters θ  , θ  ,  k , β k , and e k , plus data on wage, population size, geo-political characteristics, and industry-level expenditure shares (namely, w , L , D , α k ) as follows:

(12) X ji,k = X ji,k ( θ  ,θ  ,e k , k ,β k ;α k ,w,L,D)

= ∑

z=  k

 j,k ( w j τ ji,k ( β k ;D) )

− θ z

________________________

∑  n=1  n,k ( w n τ ni,k ( β k ;D) )

− θ z e i,z α i,k w i L i .

The above estimation is subject to a basic identification challenge, which is well known in the structural gravity literature. That is, the effect of both θ  and θ  on trade values cannot be separately identified from that of β k . The standard solution to this problem is to set the trade elasticity to a value, which is externally estimated with auxiliary data. This solution is, however, not fully applicable here, given that the trade elasticity is estimable only up to the same level of aggregation at which multilateral trade flows are reported. To handle this issue, I impose the following restriction that trade elasticities estimated with industry-level data are a trade-weighted average of the segment-level elasticities.

R4. If θˆ k is a trade elasticity estimated using industry-level trade data, then

θˆ k =

∑ j,i ∑ z∈ 

k

_____________ θ z X ji,z ∑ j,i ∑ z∈ 

k

X ji,z

,

where θ z is the unobservable segment-level trade elasticity. To elaborate on Restriction R4, consider my main dataset that reports trade flows

at the level of WIOD industries. Caliendo and Parro (2014) merge the WIOD data with industry-level tariff data to estimate the trade elasticity, θˆ k , for each WIOD industry. R4 allows me to relate their estimated elasticity, θˆ k , to segment-level elasticities, θ  and θ  , for each industry. In other words, per R4, I can take the externally

32An alternative approach is to allow for reduced-form asymmetries in trade costs as in Waugh (2010). However,

as noted by Waugh (2010), such asymmetries "may be reduced-form representations of equilibrium responses to the fundamentals faced by agents." As I will argue shortly, and in-line with the quoted remark, even under R2 the present model features asymmetries in real trade costs that arise due to equilibrium responses.

estimated θˆ k and estimate a spread, ρ k ≡ θ  / θ  , to uniquely determine both θ  and θ  as follows:

(13) θ  ( ρ k ,X  ,X  ; θˆ k ) = (

____________________ ∑ j,i ∑ z=, X ji,z ∑ j,i X ji, + ρ −1 ∑ j,i X ji, ) θ

ˆ k ,

θ  ( ρ k ,X  ,X  ; θˆ k ) = (

__________________ ∑ j,i ∑ z=, X ji,z ρ ∑ j,i X ji, + ∑ j,i X ji, ) θ

ˆ k .

Importantly, unlike the trade elasticity levels, the spread ρ k ≡ θ  / θ  can be separately identified from β k .33

The Estimation Problem.—Given parametric restrictions R1–R4; data on wage and population size, w and L , geo-political variables, D , industry-level expenditure shares, α k , and revenue shares, r k = [ r j,k ] ; we can estimate the vector of parameters, Θ k = { ρ k ,e k , k ,β k } , separately for each industry to match the factual matrix of industry-level trade values, X k  = [ X  ji,k ] . The estimation problem can be formally stated as follows:



#### ∑

(lnX ji ,k − lnX ji,k ( Θ k ;α k ,w,L,D) )

2

min Θ

i≠j

j=1

k

subject to

X ji,k ( · ) = ∑

λ ji,z e i,z α i,k w i L i ∀ i,j,

z=,

λ ji,z =  j,k ( w j τ ji,k ) − θ z / ( ∑

) ∀ i,j; z = ,,



 n,k ( w n τ ni,k ) − θ z

n=1

τ ji,k = β k ⋅ β border,k ⋅ β lang,k ⋅ β agreement,k ( Dist ji ) β dist,k ∀ i,j,

θ z = θ z ( ρ k ,X  ,X  ;θ ˆ k ) , z = ,, (LMC) ∑



X ji,k = r j,k w j L j ∀ j,

i=1

where the last constraint corresponds to the equilibrium labor market-clearing condition (LMC), and θ k ( · ) is described by (13). The above estimator is a basic extension of the structural gravity estimation in Anderson and van Wincoop (2003). More specifically, the structural gravity estimation in Anderson and van Wincoop (2003) corresponds to a restricted version of the above estimator that imposes ρ k ≡ θ  / θ  = 1. Following Anderson and van Wincoop (2003), the nonlinear least square(NLLS) estimator, specified above, is unbiased ifε ≡ X  − X

33See Fieler (2011) for a thorough discussion on how the trade elasticity spread, ρ k , is separately identified from the trade cost parameters, β k .

is uncorrelated with the derivative of X ji,k ( Θ k ;α k ,w,L,D) with respect to α k , w , L , and D , which is the case if ε represents measurement errors.

As noted earlier, I run the above estimation using two different datasets. First, I estimate the model with country-level trade data for a sample of 100 countries. The country-level estimation treats each economy as one integrated industry. Second, I estimate the model separately for 15 traded WIOD industries using industry-level bilateral trade and production data for 32 major economies. The former estimation is used to study the implications of the model for international TFP differences. The latter estimation is used to quantify the gains from within-industry specialization; Appendix D provides further details about each estimation.

Finally, to conduct the above estimation, I borrow trade elasticity estimates (namely, θ ˆ k ) from the literature. For the industry-level estimation conducted on the WIOD sample, I take the elasticities from Caliendo and Parro (2014), with details provided in Table D2. For the country-level estimation, where each country is treated as one integrated industry, I set θˆ = 4 as in Anderson and van Wincoop (2003). This choice according to the meta-analysis in Head and Mayer (2014) approximates the median elasticity estimated in the existing literature.

In-Sample Fit.—Before moving forward, a brief discussion regarding the in-sample fit of the model is in order (a more thorough discussion is also provided in Appendix D). I center the discussion here around the model estimated using country-level data, noting that similar arguments apply to the industry-level estimation. The country-level estimation (where  = 100) fits the model to 100 × (100 − 1) = 9,900 data points on bilateral trade values.34 Figure 1 displays the in-sample fit of the estimated model. Table D1 (in the Appendix) also reports a summary of the main estimated parameters and compares them those attained under the standard structural gravity estimation à la Anderson and van Wincoop (2003). A notable contrast between the two estimations is that the main model exhibits an R 2 = 0.65, which is significantly higher than that of the standard structural gravity estimation.35

As discussed in Appendix D, the superior in-sample fit of the main model is due to its ability to match (i) the low volume of trade between similar trading partners ( South-South and North-North trade) as well as (ii) the high-volume of trade between dissimilar partners ( North-South trade). The standard gravity estimation, by comparison, has difficulty matching the differential levels of similar-similar and dissimilar-dissimilar trade (see Figure D1 in the Appendix). This feature of the present model is comparable to that of Fieler (2011).

34The number of estimated parameters in the country-level case is 206, which includes 100 exporter fixed effects (labeled  ), 100 country-level expenditure shares on segment  (labeled e), 6 trade cost parameters (labeled β), plus an across-segment trade elasticity spread (labeled ρ).
35Another way of enhancing the in-sample fit of the structural gravity model is to relax the symmetric trade cost assumption. In that case, trade is no longer bilaterally balanced and the exporter and importer fixed effects need not to be equal. In fact, with asymmetric trade costs, a reduced-form gravity estimation that uses a PPML estimator, becomes isomorphic to the structural gravity estimation (see Head and Mayer 2014).

25

20

()log trade valuemodel

15

10

5

0 5 10 15 20 25

log trade value (data)

Figure 1. Fit of the Model Estimated with Country-Level Data

Out-of-Sample Prediction with respect to Markups.—The initial motivation behind my theory where two facts: that export price and markup levels increase with the exporting economy's (i) income per capita, and (ii) distance to global markets. Encouragingly, the estimated model reproduces these out-of-sample facts relatively well. To demonstrate this, I suppose the underlying demand system is CES, so that markups are constant within each segment. Based on this assumption, I can easily compute the model-implied average export markup for each country j in the sample, labeling it mˆ j . After calculating the export markup levels, I run the following regression,

ln ̅ Dist ji + ε j ,

lnmˆ j = 0.07 (0.005)

lnGDPcap j + 0.32 (0.006)

which implies that the predicted export markup level increases significantly with both income per capita, GDPcap j , and remoteness, ̅ Dist ji . In comparison, the standard gravity model that does not admit low- and high-markup segments within industries, predicts a uniform markup across all exporters.

B. The Gains from Within-Industry Specialization

In this section, I use the model estimated with industry-level data to compute the gains from within-industry specialization. As shown in Appendix C, the overall gains from trade for country i can be calculated using the following formula:36



(14) G T i = 1 − ∏ k=1

(  i,k λ ii,z ) e i,z α i,k / θ z ,

∏

z∈  k

where λ ii,z denotes the share of country i 's expenditure on domestic varieties in segment z , and  i,k ≡ α i,k / r i,k [ ( ∑ z∈ 

r i,z / ε z ) ] is a term that accounts for the scale-driven gains—as before, r i,k is used to denote the share of country i 's output generated in industry k and, correspondingly, r i,z denotes the ( within-industry) fraction of output generated in segment z ∈  k . The above equation is a simple extension of the celebrated Arkolakis, Castinot, and RodríguezClare (2012—henceforth, ACR) formula. The challenge in evaluating equation (14) is that it requires data on segment-level variables, λ ii,z , r i,z , and θ z . The estimation outlined in the previous section identifies these exact variables from industry-level trade and production data as well as industry-level trade elasticity estimates (from Caliendo and Parro 2014). Since among the estimated parameter values the segment-level elasticities, θ  and θ  , are especially key to equation (14), Table D2 explicitly reports them for each of the WIOD industries.

e i,z / ε z ) / ( ∑ z∈ 

k

Equation (14) determines the overall gains from trade, which include the traditional ACR-type gains plus the gains from within-industry specialization. Hence, to determine the pure gains from within-industry specialization, I need to purge G T i from the ACR-driven gains. To pin down the ACR-driven gains, one can shut down heterogeneity within industries to eliminate within-industry specialization across low- and high- θ segments. Doing so, the gains from trade will be driven solely by traditional forces, and following Costinot and Rodríguez-Clare (2014) will be given by



α i,k / θ k ,

(  i ACR,k λ ii,k )

G T i ACR = 1 − ∏ k=1

with  i ACR,k ≡ α i,k / r i,k accounting for scale effects in the traditional ACR class of models. To evaluate the above equation, I take an identical approach to that of Costinot

and Rodríguez-Clare (2014), which is to use industry-level data on λ ii,k , r i,k , and α i,k from the WIOD, plus trade elasticity estimates, θ k , from Caliendo and Parro (2014). After computing G T i and G T i ACR , I calculate the pure gains from within-industry specialization for every country i as follows:

Gains from specialization = G T i − G T i ACR .

Table 7 reports the gains from specialization, along with the overall andACR-driven gains from trade. In summary, the main model featuring within-industry specialization

36The above formula implicitly assumes that the within-segment demand is CES and markups are constant. Nonetheless, as it will become clear, variable markups are inconsequential to the insights presented hereafter.

Table 7—The Gains from Within-Industry Specialization Gains from trade

GDP (US = 1)

Population (US = 1)

Remoteness (US = 1)

ACR (%)

Main model (%)

Gains from specialization (%)

Country

AUS 0.07 0.07 1.47 3.7 16.7 12.9
AUT 0.03 0.03 0.41 30.5 33.7 3.1 BEL 0.04 0.04 0.57 32.4 57.6 25.3 BRA 0.11 0.63 0.74 4.3 9.5 5.2 CAN 0.10 0.11 0.98 15.3 25.2 9.9 CHN 0.31 4.36 0.59 4.0 8.3 4.3 CZE 0.02 0.03 0.40 21.2 19.0 −2.2 DEU 0.26 0.27 0.53 17.6 32.7 15.1 DNK 0.02 0.02 0.52 24.8 25.1 0.3 ESP 0.11 0.15 0.54 9.5 24.3 14.8 FIN 0.02 0.02 0.47 10.5 10.3 −0.3 FRA 0.20 0.21 0.49 11.1 23.4 12.3 GBR 0.19 0.20 0.62 11.7 21.3 9.6 GRC 0.02 0.04 0.53 4.7 9.1 4.4 HUN 0.01 0.03 0.46 31.2 23.9 −7.3 IDN 0.03 0.77 0.72 4.0 6.4 2.4 IND 0.08 3.86 0.64 4.3 16.7 12.4 IRL 0.02 0.01 0.53 14.2 14.7 0.5 ITA 0.16 0.19 0.45 9.2 21.6 12.4 JPN 0.33 0.42 0.53 3.7 11.4 7.7 KOR 0.07 0.16 0.63 8.6 14.3 5.7 MEX 0.07 0.38 0.80 12.1 15.6 3.5 NLD 0.06 0.05 0.73 23.1 43.4 20.3 POL 0.04 0.13 0.40 19.7 28.0 8.3 PRT 0.02 0.03 0.48 20.6 15.6 −5.0 ROM 0.01 0.07 0.36 12.7 12.5 −0.2 RUS 0.11 0.47 0.65 0.9 19.4 18.5 SVK 0.01 0.02 0.52 23.6 16.2 −7.4 SVN 0.00 0.01 0.55 39.3 24.5 −14.8 SWE 0.03 0.03 0.47 14.5 19.4 4.9 TUR 0.05 0.23 0.51 13.3 23.8 10.6 TWN 0.03 0.08 0.98 9.9 14.6 4.7 USA 1.00 1.00 1.00 3.8 8.6 4.8 RoW 0.61 8.00 2.49 7.3 14.8 7.5

predicts gains from trade that are, on average, 46 percent larger than those implied by the ACR formula (20.4 percent versus 14 percent increase in real GDP). Based on these numbers, one can deduce that within-industry specialization alone increases the real GDP by around 6.4 percent for the average economy.

The gains from specialization are, however, quite heterogeneous across countries. High-wage or remote economies like Germany, France, and Australia gain significantly more as they specialize in high-scale-intensive segments, where returns to specialization are higher. In comparison, the pure gains from specialization are considerably smaller (and even negative) for developing economies like Mexico and Indonesia, as they specialize in low-scale-intensive segments where returns to specialization are relatively low.

C. International Specialization and TFP Differences

Beyond the gains from trade, the present model sheds new light on the profound gap in the standard of living across rich and poor countries. As noted by

Jones (2011), "by the end of the twentieth century, per capita income in the United States was more than 50 times higher than per capita income in Ethiopia and Tanzania. Dispersion across the ninety-fifth–fifth percentiles of countries was more than a factor of 32." There is an abundance of theories that seek to explain these persistent gaps.37 The general theme of such theories is to identify multiplier effects that amplify cross-country income/ TFP differences.

The theory of international specialization presented here identifies a possibly new multiplier effect. To demonstrate this, consider the special case of constant markups presented in Section IIB. Let L i,k denote the number of workers engaged in industry k ; free entry entails that country i will host a mass l i,k ≡ ( ∑ z∈ 

r i,z / ε z ) L i,k / f ek of local firms in that industry. Consequently, the real income per worker in country i will be given by (see Appendix C)38

k

(15) W i = A i ( ∏

##### )

( ∏

##### )



l α i,k

i,k e i,z / θ z

λ ii −, αz

i,k e i,z / θ z

∏

,

k=1

z∈  k



scale effects

trade openness

where A i , the fundamental TFP of country i , reflects its absolute advantage in producing high-quality output; l i,k reflects the effective scale of country i in industry k ; and θ z = γ k / ( ψ z − 1) is the segment-level trade elasticity, which is inversely related to the segment-level scale elasticity ψ z = ε z / ( ε z − 1) (as defined in Section II). Aside from the fundamental TFP, A i , the above equation states that real income depends on the scale-driven gains from specialization (first parentheses) and trade openness (second parenthesis).

Given Proposition 1, the scale-driven term operates as a TFP multiplier. To elaborate, note that all else equal, high-A countries pay higher equilibrium wages, which is easily verifiable using the balanced trade condition. Given Proposition 1, the higher wage induces high-A countries to specialize in low-ε z segments, which leads to a higher l i,k . Relatedly, high-A economies also specialize in industries that exhibit on average a higher scale elasticity. Together, these effects multiply the TFP differences, whereby the scale-driven term, ∏  k=1 ∏ z∈ 

i,k e i,z / θ z , is systematically larger for high-A i economies.

l i α,k

k

To calculate the contribution of specialization-driven scale effects to income differences, note that the scale-driven component in equation (15) can be decomposed into (i) a standard size-driven component, and (ii) a specialization-driven component as follows:

[ ( ∏

) ]

) / ( ∏

i,k e i,z / θ z = ( ∏

)



l α i,k

L i α,k

i,k / θ k

l i α,k

i,k e i,z / θ z

L α i,k

i,k / θ k

∏

,

k=1

z∈  k



size-driven

specialization-driven

37See Klenow and Rodríguez-Clare (1997); Prescott (1998); Hall and Jones (1999); Parente and Prescott (1999); Acemoglu, Johnson, and Robinson (2000); Parente and Prescott (2000); Caselli (2005); and Jones (2011).
38In the above expression A i ≡ ς ∏ k =1 ∏ z∈ k  e i,z

i,z / γ k (1− σ z ) , where ς encapsulates a combination of constant parameters. Also, the equation assumes firms are symmetric, but can be trivially extended to account for firm heterogeneity and selection effects—see Kucheryavyy, Lyn, and Rodríguez-Clare (2016).

where θ k is the industry-level trade elasticity that is itself a weighted average of the segment-level trade elasticities per R4. Note that the size-driven component is relatively well-known, as it is also present in the standard ACR class of models. But the specialization-driven component is unique to the present model, and absent (or equal to one for all countries) in the ACR class of models.

Considering the above decomposition, we can appeal to the estimated model to calculate the counterfactual real income of each country i , net of specialization-driven scale effects. Doing so using the model fitted to country-level trade data for 100 economies, implies that specialization across low- and high-scale-intensive segments explains around 37.3 percent of the cross-national heterogeneity in real income levels, as measured by var(logW i ) .39

V. Concluding Remarks

This paper presented an alternative view of within-industry specialization; the thesis of which is that, even in the absence of Ricardian or factor-driven comparative advantage, high-wage and remote economies have a tendency to specialize in high-market power segments of industries. I argued that this patterns of specialization (i) explains the cross-national and spatial variation in export price levels, and (ii) delivers macro-level implications that are distinct from traditional theories of within-industry quality specialization. Some of these macro-level implications were discussed thoroughly in this paper, but several others merit further attention.

First, the present theory can shed light on a recently circulating thesis that market power has been rising in the US economy (De Loecker and Eeckhout 2017). Many factors can be responsible for this rise. From the perspective of the present theory, this rise can reflect a trade-induced hyper-specialization of US-based firms in high-market power segments of industries. An empirical assessment of this hypothesis presents a promising avenue for future research.

Second, the present paper raises some old questions about industrial policy in an open economy. First, in the present framework, the labor allocation in high-market power segments is inefficiently low. This creates a scope for policy intervention, which is absent in the standard heterogeneous firm models of trade and geography. Additionally, the differential degrees of scale economies across segments creates a scope for industrial targeting, whereby countries can improve their terms-of-trade by heavily protecting the high-returns to scale segments of each industry.

Appendix A. Description of Data Used in Section I

The data used to calculate export unit prices is from the Comtrade-baci database compiled by the Cepii. This is the most comprehensive publicly-available data that permits the construction of export unit prices. The data originates from the United Nations Comtrade database. The baci variation of the data uses a harmonization methodology to reconcile mirror flows, which provides more complete

39These results are related to Bento (2014), who shows that accounting for heterogeneity within industries increases the elasticity of welfare with respect to entry costs.

geographical coverage than if only a single direction of Comtrade statistics were to be used—see Gaulier and Zignago (2010) for a description the Baci approach to harmonization. The Comtrade-baci database provides physical quantities (only in units of weight) and values of annual bilateral trade flows by 6-digit HS product categories from 2003 to 2015. For 224 countries Head, Mayer, and Ries (2010) report matching data for GDP, distance, and population size. The country-level production data is taken from the TradeProd database compiled by the Cepii. This dataset is built on the Trade, Production and Protection database made available by the World Bank (Nicita and Olarreaga 2007). It covers 26 industrial sectors in the ISIC Revision 2 from 1980 to 2006—see De Sousa, Mayer, and Zignago (2012) for more details.

Appendix B. General Cost Function

In the interest of exposition, the main model abstracted from intranational cost heterogeneity across firms within a segment. However, given the weak isomorphism between quality and productivity, the model can be trivially amended to also accommodate cost heterogeneity across firms. To this end, suppose firms are heterogeneous in their unit labor cost, a z (φ) , which increases with output quality

at elasticity, κ —i.e., ∂ lna z (φ) / ∂ lnφ = κ . Considering this specification, the effective marginal cost facing a firm becomes

_ ρ z

c̃ z (φ;τ ji,k ,w j ) = τ ji,k a z w j φ κ+

ρ z −1 .

Expectedly, under κ = 0, the above expression reduces to the one presented in Section II. The above marginal cost function can be reformulated in terms of an augmented degree of quality intensity, ρ ̃ z ≡ 1 − (1 − ρ z ) / (1− κ(1− ρ z )) , as follows:

_ ρ̃ z

c̃ z (φ;τ ji,k ,w j ) = τ ji,k w j φ

ρ̃ z −1 .

If 1 + κ < 1/ (1 − ρ z ) , then (i) ρ ̃ z < 1, and (ii) ρ ̃ z is strictly increasing in ρ z . That being the case, all the results presented in the paper follow qualitatively in this more

general setup, but have to be reformulated in terms of ρ ̃ z instead of ρ z . For example, the gravity equation (4) will assume the following formulation:

_________________________ ρ ̃ z ) ∑ n =1  n,z N n,k ( τ ni,k w n ) γ k (1− _ ρ 1 ̃

 j,z N j,k ( τ ji,k w j ) γ k (1−

_ 1

X ji,z =

e i,z α i,k Y i ,

z )

which immediately implies Lemma 1. That is, high-w and high- τ economies have a revealed comparative advantage in high-ρ ̃ segments, which are also the high- ρ segments, by construction. The assumption that 1 + κ < 1/ (1 − ρ z ) ensures that a higher output quality does not increase the production cost so much that it puts the high-quality firms at a disadvantage—a very similar assumption underlies Baldwin and Harrigan's (2011) quality-sorting model.

Appendix C. Proofs and Derivations A. Proof of Lemma 2

The distribution of export markups from country j to market i can be defined as a function of the bilateral trade costs, τ ji,k , and country j 's wage rate, w j , as follows:

 i,z (m;w j ,τ ji,k ) = Pr{μ(ν) ≤ m ∣ ν ≥ ν i ⁎,z } .

Note that given the definition of ν ≡ P / c̃ , the market specific cut-off ν ⁎ i,z can be expressed as

_ ρ z

( P i,z / τ ji,k w j ) ( φ ji ⁎,z )

1− ρ z = ν i ⁎,z ,

where recall that φ ⁎ ji,z denotes the lowest quality under which a firm can profitably export from country j to i . Considering (i) the above equation for ν ⁎ i,z , and (ii) that for each firm exporting from country j , ν = ν i ⁎,z [φ / φ ji ⁎,z ] ρ z /(1− ρ z ) , the markup distribution can be stated as

____________________________________ 1− ρ z ) < m,φ < φ ji ⁎,z ) Pr(φ < φ ji ⁎,z )

Pr( μ i,z ( ν i ⁎,z [φ / φ ji ⁎,z ]

_ ρ z

 i,z (m;w j ,τ ji,k ) =

.

Given Assumption A3, d G j,z (φ) = γ k  j,z φ − γ

k −1 , which simplifies the above expression as follows:

____ 1−ρ z

( ν i ⁎,z μ i −,z1 (m))

ρ z φ ⁎ ji,z d G j,z (φ)

∫ φ

− ____1− ρ zρ z γ k ,

⁎

= 1 − ( ν i,z

⁎ μ i −,z1 (m) )

 ji,z (m;w j ,τ ji,k ) =

ji,z

______________________

∫ φ ∞ ji ⁎,z d G j,z (φ)

which clearly does not depend on the characteristics of the exporting country, j . ∎ B. Proof of Proposition 1

To prove Proposition 1, it suffices to show that high-w and high-τ economies specialize in high-trade elasticity segments, where the trade elasticity is θ z = γ k (1 − 1/ ρ z ) . Once this claim is established, Proposition 1 follows immediately from Assumption A4. First, note that the segment-neutrality of technology parameters entails that  i,z =  i,k  z for z ∈  k . Plugging this condition into equation (3), the total output of country j in segment z ∈  k can be expressed as follows:

k ( __ 1

ρ z −1)  i,z

k ( __ 1

#### ρ z −1) ∑

N

τ ji γ,k

 z (  j ,w j ,τ j,k ) =  j,k N j,k w j γ

i=1

where the term  i,z ≡  z χ i,z Q i,z Y i is introduced to simplify the notation, and τ j,k = { τ ji,k } . Note that, based on the above equation, national output per

segment z ∈  k , is uniquely determined by  j,k , w j , τ j,k , where w j itself is an implicit function of  j,k and τ j,k . Correspondingly, the share of segment z ∈  k in total industry-level output can be expressed solely as a function of w j and τ j,k :

∑  i=1 τ ji −, θk

w j − θ

____________________  i,z ∑ z′∈ 

z

________ R z ∑ z′∈ 

.

r z ( w j ,τ j,k ) ≡

=

∑  i=1 τ ji −,θk

w j −θ

 i,z′

z′

R z′

z′

k

That is, the effect of  j,k on r z is entirely passed through the economy-wide wage rate. Now, consider segments  and  ∈  k that differ in their trade elasticity θ  < θ  . An α-fold increase in country j 's wage (holding τ j,k fixed) leads to an α θ

 − θ  > 1 ⇔ α > 1. In particular,

B − θ A -fold increase in r  / r  , where α θ

r ___________  (α w j ,τ j,k ) r  (α w j ,τ j,k )

r ___________  ( w j ,α τ j,k ) r  ( w j ,α τ j,k )

 − θ  r _  ( w j ,τ j,k ) r  ( w j ,τ j,k )

(C1)

= α θ

>

.

Similarly, an α-fold increase in country j 's remoteness (holding w j fixed40) leads to an α θ

 −θ -fold increase in in r  / r  , where α θ

 −θ > 1 ⇔ α > 1. In particular,

r ___________  ( w j ,α τ j,k ) r  ( w j ,α τ j,k )

 −θ r ___________  ( w j ,α τ j,k ) r  ( w j ,α τ j,k )

r ___________  ( w j ,α τ j,k ) r  ( w j ,α τ j,k )

(C2)

= α θ

>

.

Together, inequalities (C1) and (C2) imply that high-wage and remote economies specialize in low- θ segments, which per Assumption A4 are also the high-market power segments of the industry.

Welfare Formulas.—Here, I derive formulas that characterize the gains from trade

in the estimated model. To derive the formula for G T i , I follow the same steps as in Costinot and Rodríguez-Clare (2014). From the gravity equation (4), I can deduce

the following two expressions for λ ii,z and P i,z ,

λ ii,z =  i,z N i,k ( τ ii,k w j ) −θ z / ∑

 j,z N j,k ( τ ji,k w j ) −θ z ,

j

P i,z = ς i,z ( ∑

 j,z N j,k ( τ ji,k w j ) θ z )

1/θ z ,

j

where ς i,z is composed of market-specific constants that are invariant to trade costs. Combining the above two equations implies

that C i,z = w i / P i = ς i,z ( λ ii,z /  i,z N i,k ) −1/θ z ; which combined with R1 and R2 yields the following formula for aggregate welfare:

(C3) W i = ∏

k

C i α,z i,k e i,z = ∏

∏

z∈  k

k

−α i,k e i,z / θ z .

ς i,z ( λ ii,z /  i,z N i,k )

z∈  k

40Holding w j fixed entails an increase in  j to counter the increase in trade costs.

Using the familiar hat notation to denote the change in variables when moving from the factual to the counter-factual equilibrium, I can arrive at the following expression for changes in welfare:

−α i,k e i,z / θ z

( λˆ ii,z / Nˆ i,k )

Wˆ i = ∏

.

∏

z∈  k

k

To characterize Nˆ i,k in the above equation, I can appeal to the free entry condition (FE), which can be stated as follows in the constant markup case:

 _ i,z ε z ,

w i N i,k f k e = ∑

z∈  k

with  i,z = ∑ n =1 X in,z denoting the total revenue generated by economy i in segment z ∈  k . Rearranging the above equation and noting that  i = w i L i , we can arrive at the following equation describing the mass of firms,

f k e (∑  _ i,z / ε z  i,k )

 _ i,k  i

 _ i w i = ( ∑

_ r i,z ε z ) r i,k L i ,

N i,k = __ 1

z∈  k

where, by definition, r i,z ≡  i,z /  i,k and r i,k ≡  i,k /  i . The above equation immediately implies that Nˆ i,k = ( ˆ ∑ z∈ 

r i,z / ε z ) rˆ i,k , which when plugged back into the formula for Wˆ i , delivers the following:

k

/ [ ˆ ∑

− α i,k e i,z / θ z

∏ z∈  k ( λ

ε z ] rˆ i,k )



_ r i,z

Wˆ i = ∏ k=1

.

ˆ ii,z

z∈  k

Noting that in autarky (i) λ ii,z = 1; (ii) r i,k = α i,k , and (iii) ∑ z∈ 

r i,z / ε z

= ∑ z∈ 

e i,z / ε z , the above equation implies the following formula for the gains from trade,

k



G T i = 1 − 1/ Wˆ i = ∏

(  i,k λ ii,z ) α i,k e i,z / θ z ,

∏

k=1

z∈  k

e i,z / ε z ) / ( ∑ z∈ 

r i,z / ε z ) ] . Similarly, the expression for aggregate welfare (equation (15)) can be derived by appealing to equation (C3), as follows,

where  i,k ≡ α i,k / r i,k [ ( ∑ z∈ 

k



− α i,k e i,z / θ z

W i = ∏ k=1

ς i,z ( λ ii,z /  i,z N i,k )

∏

z∈  k

= ( ς i ∏

##### ) ( ∏

##### ) ,



l i α,k

i,k e i,z / θ z

 α i,z

i,k e i,z / θ z

λ ii −, αz

i,k e i,z / θ z

∏

k=1

z∈  k

r i,z / ε z ) L i,k . Replacing θ z = γ k ( ε z − 1) and noting (from Section II) that ψ z = ε z / ( ε z − 1) , we will arrive at the welfare formula specified by equation (15).

ς i,z and l i,k ≡ ( ∑ z∈ 

where ς i ≡ ∏ k ∏ z∈ 

k

Appendix D. Gravity Estimation Details

This Appendix provides details regarding the data employed and parameter values attained under the country- and industry-level estimations in Section IV.

A. Country-Level Estimation

The economy-wide estimation treats each economy as one integrated industry, i.e.,  = 1.41 So I present the results here without the industry subscript, k . To conduct the estimation, I take country-level trade data from the U.N. Comtrade database (Comtrade 2010) for a sample of the 100 largest economies (in terms of real GDP) in year 2000—this sample accounts for more than 95 percent of world trade in that year. I merge this data with (i) data on population size and GDP from the World Bank ( World Bank 2012), and data on geo-distance, common language, borders, and trade agreements from Mayer and Zignago (2011).

In the country-level implementation, I am estimating 206 structural parameters by fitting the model to (100 − 1) × 100 = 9,900 data points on bilateral trade values, namely X = [ X ji ] . The estimated parameters are the following:

(i) One hundred exporter fixed effects,  = [  j ] ;
(ii) One hundred imported fixed effects, e = [ e i, ] , that characterize the expenditure shares on segment  ;42
(iii) Five parameters, vector β , that characterize the iceberg trade costs; and
(iv) One parameter, ρ = θ / θ  , describing the spread between the trade elasticity in segments  and .

Table D1 presents the estimation results. In the interest of space, I only report the main parameters, β and ρ. The same table also compares the results to those attained under the standard structural gravity estimation with no heterogeneity in trade elasticities à la Anderson and van Wincoop (2003). Evidently, the main model exhibits a significantly higher predictive power than the standard, uniform elasticity model—the main model exhibits an R 2 = 0.65, whereas the standard model exhibits an R 2 = 0.46.

The present model's superior fit stems from its ability to match the following feature of bilateral trade data: rich-poor trading partners engage in significantly more two-way trade (i.e., have a higher X ji ) than rich-rich or poor-poor trading partners. To illustrate this point formally, let North denote the richest 20 countries in my sample and South denote the other 80 countries. Figure D1 illustrates that in year 2000 North-South trade was conducted more intensively than both North-North trade. As illustrated in Figure D1, the standard model

41This treatment of the economy resembles the approach taken by Fieler (2011). 42Once e is known, the expenditure share on segment  is automatically given by [ e i, ] = 1 − e .

Table D1—Estimation Results with Country-Level Data Parameters Standard gravity model θ  = θ  Main model θ  < θ  ρ ≡ θ _ θ 

... 0.54

(0.004) β const 2.83 4.73



(0.020) (0.065) β dist 0.43 0.36

(0.003) (0.006) β border 0.53 0.87

(0.006) (0.015) β lang 0.52 0.84

(0.005) (0.011) β agreement 0.98 0.93

(0.010) (0.020)

R 2 0.46 0.65 Observations 9,900

Note: Standard errors are reported in parentheses.

0.5

0.4

0.3

0.2

0.1

0

Model: θ < θ Data Model: θ = θ

|South−South North−South North−North| |---|

Figure D1. The Composition of Aggregate Trade Flows: Model versus Data Note: The North corresponds to the 20 richest countries in the sample.

estimated subject to θ / θ  = 1 has difficulty matching this pattern. The reason being that, under θ  / θ  = 1, the model admits no systematic pattern of intra-industry specialization that makes North-South trade relatively more attractive than North-North trade. The estimated model that treats θ / θ  < 1 as free-moving parameter, however, admits North-South specialization across low- and high- θ segments. This feature of the model renders North-South trade relatively more attractive. Importantly, the extent of North-South specialization

depends on θ  / θ  and e = [ e i, ] , which in turn guides the identification of these parameters.

B. Industry-Level Estimation

The industry-level estimation uses data from the World Input-Output Database (WIOD, Timmer et al. 2012) in 2008. The original WIOD database covers 35 industries and 40 countries, which account for more than 85 percent of world GDP, plus an aggregate of the rest of the world. The countries in the sample include all 27 members of the European Union plus 13 other major economies, namely, Australia, Brazil, Canada, China, India, Indonesia, Japan, Mexico, Russia, South Korea, Taiwan, Turkey, and the United States. The 35 industries in WIOD database are listed in Table D2, which include 15 tradable industries plus 20 service-related industries. Following Costinot and Rodríguez-Clare (2014), I contract the WIOD database into a sample of 32 major economies (listed in Table 7) plus an aggregate of the rest of the world. For each industry k , the trimmed WIOD data reports the industry-level vector of expenditure shares, α k ≡ [ α i,k ] , revenue shares, r k ≡ [ r i,k ] , the full matrix of industry-level trade flows, X k ≡ [ X ji,k ] , as well as country-level data on income, Y i = w i L i , and population size L i . As with the country-level data, I complement the WIOD with data on geo-distance, common language, borders, and trade agreements from Mayer and Zignago (2011). Following Costinot and Rodríguez-Clare (2014), the industry-level trade elasticity estimates are taken from Caliendo and Parro (2014) and are reported in Table D2.

Here, for each of the 15 ( nonservice) industries in the sample, I am separately estimating 70 structural parameters by fitting the model to (32 − 1) × 32 = 992 data points on industry-level bilateral trade values, namely X k = [ X ji,k ] .43 The estimated parameters are:

(i) Thirty-two exporter × industry fixed effects,  k = [  j,k ] ;
(ii) Thirty-two importer × industry fixed effects, e k = [ e i, ] , that characterize the expenditure shares on segment  in industry k ;44
(iii) Five parameters, vector β k , that characterize the iceberg trade costs in industry k ; and
(iv) One parameter, ρ k = θ  / θ  , describing the spread between the segment-level trade elasticities in industry k .

In all incidences, the industry-level estimation exhibits an R 2 of 0.50 or greater. The last two columns in Table D2 report the segment-level trade elasticities estimated for each of the 15 nonservice industries in the WIOD sample. For some

43So, in total, I am estimating 15 × 70 = 1,050 parameters across all industries in the sample. 44Once e k is known, the expenditure share on segment  is automatically given by [ e i, ] = 1 − e k .

Table D2—List of WIOD Industries in the Industry-Level Estimation

Industry-level θ ( Caliendo-Parro) θ θ

WIOD sector Sector's description

1 Agriculture, hunting, forestry, and fishing 8.11 6.28 9.43
2 Mining and quarrying 15.72 11.53 17.12
3 Food, beverages, and tobacco 2.55 0.89 3.71
4 Textiles and textile products; leather and footwear 5.56 3.93 7.06
5 Wood and products of wood and cork 10.83 9.10 12.00
6 Pulp, paper, printing, and publishing 9.07 6.03 9.92
7 Coke, refined petroleum, and nuclear fuel 51.08 49.38 55.10
8 Chemicals and chemical products 4.75 3.80 5.82
9 Rubber and plastics 1.66 0.62 3.35
10 Other non-metallic mineral 2.76 0.91 3.57
11 Basic metals and fabricated metal 7.99 6.55 9.03
12 Machinery, nec. 1.52 0.56 4.90
13 Electrical and optical equipment 10.60 8.78 12.48
14 Transport equipment 0.37 0.16 0.75
15 Manufacturing, nec.; recycling 5.00 3.29 6.76
16 Electricity, gas, and water supply Construction Sale, maintenance, and repair of motor vehicles and

5 ... ...

motorcycles; retail sale of fuel Wholesale trade and commission trade, except of motor vehicles and motorcycles Retail trade, except of motor vehicles and motorcycles;

repair of household goods Hotels and restaurants Inland transport Water transport Air transport Other supporting and auxiliary transport activities;

activities of travel agencies Post and telecommunications Financial intermediation Real estate activities Renting of M&Eq and other business activities Education Health and social work Public admin. and defence; compulsory social security Other community, social, and personal services Private households with employed persons

###### industries, such as "Petroleum" or "Fabricated Metals," θ  and θ  are not significantly different, whereas in the "Machinery" and "Food" industries the estimation identifies a significant degree of within-industry θ heterogeneity.

REFERENCES

Acemoglu, Daron, Simon Johnson, and James A. Robinson. 2000. "The Colonial Origins of Compara-

tive Development: An Empirical Investigation." NBER Working Paper 7771.

Alessandria, George, and Joseph P. Kaboski. 2011. "Pricing-to-Market and the Failure of Absolute

PPP." American Economic Journal: Macroeconomics 3 (1): 91–127.

Anderson, James E., and Eric van Wincoop. 2003. "Gravity with Gravitas: A Solution to the Border

Puzzle." American Economic Review 93 (1): 170–92.

Anderson, T.W., and Herman Rubin. 1949. "Estimation of the Parameters of a Single Equation in a

Complete System of Stochastic Equations." Annals of Mathematical Statistics 20 (1): 46–63.

Arkolakis, Costas, Arnaud Costinot, Dave Donaldson, and Andrés Rodríguez-Clare. 2019. "The Elu-

sive Pro-competitive Effects of Trade." Review of Economic Studies 86 (1): 46–80.

Arkolakis, Costas, Arnaud Costinot, and Andrés Rodríguez-Clare. 2012. "New Trade Models, Same

Old Gains?" American Economic Review 102 (1): 94–130.

Baldwin, Richard, and James Harrigan. 2011. "Zeros, Quality, and Space: Trade Theory and Trade

Evidence." American Economic Journal: Microeconomics 3 (2): 60–88.

Baldwin, Richard E., and Tadashi Ito. 2008. "Quality Competition versus Price Competition Goods:

An Empirical Classification." NBER Working Paper 14305.

Bento, Pedro. 2014. "Niche Firms, Mass Markets, and Income across Countries: Accounting for the

Impact of Entry Costs." Journal of Economic Dynamics and Control 48: 147–58.

Berry, Steven, James Levinsohn, and Ariel Pakes. 1995. "Automobile Prices in Market Equilibrium."

Econometrica 63 (4): 841–90.

Bertoletti, Paolo, and Federico Etro. 2017. "Monopolistic Competition When Income Matters." Eco-

nomic Journal 127 (603): 1217–43.

Bertoletti, Paolo, Federico Etro, and Ina Simonovska. 2018. "International Trade with Indirect Addi-

tivity." American Economic Journal: Microeconomics 10 (2): 1–57.

Bils, Mark, and Peter J. Klenow. 2001. "Quantifying Quality Growth." American Economic Review

91 (4): 1006–30.

Brandt, Loren, Johannes Van Biesebroeck, Luhang Wang, and Yifan Zhang. 2017. "WTO Accession and Performance of Chinese Manufacturing Firms." American Economic Review 107 (9): 2784–2820. Broda, Christian, and David E. Weinstein. 2006. "Globalization and the Gains from Variety." Quar-

terly Journal of Economics 121 (2): 541–85.

Brooks, Wyatt, and Pau Pujolas. 2016. "Nonlinear Gravity." https://www3.nd.edu/~wbrooks/

NonlinearGravity.pdf.

Caliendo, L., and F. Parro. 2014. "Estimates of the Trade and Welfare Effects of NAFTA." Review of

Economic Studies 82 (1): 1–44.

Campa, Jose, and Linda S. Goldberg. 1995. "Investment in Manufacturing, Exchange Rates and Exter-

nal Exposure." Journal of International Economics 38 (3–4): 297–320.

Caselli, Francesco. 2005. "Accounting for Cross-Country Income Differences." In Handbook of Economic Growth, Vol. 1A, edited by Philippe Aghion and Steven N. Durlauf, 679–741. Amsterdam: North-Holland.

Chaney, Thomas. 2008. "Distorted Gravity: The Intensive and Extensive Margins of International

Trade." American Economic Review 98 (4): 1707–21.

Choi, Yo Chul, David Hummels, and Chong Xiang. 2009. "Explaining Import Quality: The Role of the

Income Distribution." Journal of International Economics 78 (2): 293–303.

Coibion, Olivier, Liran Einav, and Juan Carlos Hallak. 2007. "Equilibrium Demand Elasticities across

Quality Segments." International Journal of Industrial Organization 25 (1): 13–30.

Costinot, Arnaud, and Andrés Rodríguez-Clare. 2014. "Chapter 4—Trade Theory with Numbers: Quantifying the Consequences of Globalization." In Handbook of International Economics, Vol. 4, edited by Gita Gopinath, Elhanan Helpman, and Kenneth Rogoff, 197–261. Amsterdam: North-Holland. Cragg, John G., and Stephen G. Donald. 1993. "Testing Identifiability and Specification in Instrumen-

tal Variable Models." Econometric Theory 9 (2): 222–40.

Crozet, Matthieu, Keith Head, and Thierry Mayer. 2012. "Quality Sorting and Trade: Firm-level Evi-

dence for French Wine." Review of Economic Studies 79 (2): 609–44.

de Blas, Beatriz, and Katheryn N. Russ. 2015. "Understanding Markups in the Open Economy." Amer-

ican Economic Journal: Macroeconomics 7 (2): 157–80.

De Loecker, Jan, and Jan Eeckhout. 2017. "The Rise of Market Power and the Macroeconomic Implications." NBER Working Paper 23687.
De Loecker, Jan, and Jan Eeckhout. 2018. "Global Market Power." NBER Working Paper 24768. De Loecker, Jan, Catherine Fuss, and Johannes Van Biesebroeck. 2014. "International Competition

and Firm Performance: Evidence from Belgium." National Bank of Belgium Working Paper 269.

De Loecker, Jan, Pinelopi K. Goldberg, Amit K. Khandelwal, and Nina Pavcnik. 2016. "Prices, Mark-

ups, and Trade Reform." Econometrica 84: 445–510.

De Sousa, José, Thierry Mayer, and Soledad Zignago. 2012. "Market Access in Global and Regional

Trade." Regional Science and Urban Economics 42 (6): 1037–52.

Díez, Federico J., Dabuek Leigh, and Suchanan Tambunlertchai. 2018. "Global Market Power and Its

Macroeconomic Implications." IMF Working Paper 18/137.

Dingel, Jonathan I. 2016. "The Determinants of Quality Specialization." Review of Economic Studies

84 (4): 1551–82.

Eaton, Jonathan, Marcela Eslava, C.J. Krizan, Maurice Kugler, and James Tybout. 2010. "A Search and Learning Model of Export Dynamics." http://citeseerx.ist.psu.edu/viewdoc/download?doi=10. 1.1.189.2412&rep=rep1&type=pdf.

Eaton, Jonathan, Samuel Kortum, and Francis Kramarz. 2011. "An Anatomy of International Trade:

Evidence from French Firms." Econometrica 79 (5): 1453–98.

Edmond, Chris, Virgiliu Midrigan, and Daniel Yi Xu. 2015. "Competition, Markups, and the Gains

from International Trade." American Economic Review 105 (10): 3183–3221.

Erkel-Rousse, Hélène, and Daniel Mirza. 2002. "Import Price Elasticities: Reconsidering the Evi-

dence." Canadian Journal of Economics 35 (2): 282–306.

Fajgelbaum, Pablo, Gene M. Grossman, and Elhanan Helpman. 2011. "Income Distribution, Product

Quality, and International Trade." Journal of Political Economy 119 (4): 721–65.

Fally, Thibault. 2018. "Integrability and Generalized Separability." NBER Working Paper 25025.
Fally, Thibault. 2019. "Generalized Separability and the Gains from Trade." Economics Letters 178: 8–12.

Fan, Haichao, Yao Amber Li, and Stephen R. Yeaple. 2015. "Trade Liberalization, Quality, and Export

Prices." Review of Economics and Statistics 97 (5): 1033–51.

Feenstra, Robert C., and John Romalis. 2014. "International Prices and Endogenous Quality." Quar-

terly Journal of Economics 129 (2): 477–527.

Fieler, Ana Cecília 2011. "Nonhomotheticity and Bilateral Trade: Evidence and a Quantitative Expla-

nation." Econometrica 79 (4): 1069–1101.

Flach, Lisandra. 2017. "A Note on the Isomorphism of Heterogeneous Firms Models." Economics Let-

ters 155: 24–27.

Fornell, Claes, Michael D. Johnson, Eugene W. Anderson, Jaesung Cha, and Barbara Everitt Bryant.

"The American Customer Satisfaction Index: Nature, Purpose, and Findings." Journal of Marketing 60 (4): 7–18.

Gaulier, Guillaume, and Soledad Zignago. 2010. "Baci: International Trade Database at the Prod-

uct-Level. The 1994–2007 Version." CEPII Research Center Working Paper 2010-23.

Goldberg, Pinelopi Koujianou. 1995. "Product Differentiation and Oligopoly in International Markets:

The Case of the U.S. Automobile Industry." Econometrica 63 (4): 891–951.

Grossman, Gene M., and Elhanan Helpman. 1991. "Quality Ladders in the Theory of Growth." Review

of Economic Studies 58 (1): 43–61.

Hall, Robert E., and Charles I. Jones. 1999. "Why Do Some Countries Produce So Much More Output

per Worker than Others?" Quarterly Journal of Economics 114 (1): 83–116.

Hallak, Juan Carlos. 2006. "Product Quality and the Direction of Trade." Journal of International Eco-

nomics 68 (1): 238–65.

Hallak, Juan Carlos, and Jagadeesh Sivadasan. 2013. "Product and Process Productivity: Implications for Quality Choice and Conditional Exporter Premia." Journal of International Economics 91 (1): 53–67.

Hanoch, Giora. 1975. "The Elasticity of Scale and the Shape of Average Costs." American Economic

Review 65 (3): 492–97.

Hausman, Jerry, Gregory Leonard, and J. Douglas Zona. 1994. "Competitive Analysis with Differen-

tiated Products." Annales d'Economie et de Statistique 34: 159–80.

Head, Keith, and Thierry Mayer. 2014. "Gravity Equations: Workhorse, Toolkit, and Cookbook." In Handbook of International Economics, Vol. 4, edited by Gita Gopinath, Elhanan Helpman, and Kenneth Rogoff, 131–95. Amsterdam: North-Holland.

Head, Keith, Thierry Mayer, and John Ries. 2010. "The Erosion of Colonial Trade Linkages after

Independence." Journal of International Economics 81 (1): 1–14.

Helpman, Elhanan. 1984. "Chapter 7—Increasing Returns, Imperfect Markets, and Trade Theory. In Handbook of International Economics, Vol. 1, edited by Ronald W. Jones and Peter D. Kenen, 325–

Amsterdam: North-Holland.

Hertel, Thomas, David Hummels, Maros Ivanic, and Roman Keeney. 2007. "How Confident Can We

Be of CGE-based Assessments of Free Trade Agreements?" Economic Modelling 24 (4): 611–35.

Holmes, Thomas J., Wen-Tai Hsu, and Sanghoon Lee. 2014. "Allocative Efficiency, Mark-ups, and the

Welfare Gains from Trade." Journal of International Economics 94 (2): 195–206.

Holmes, Thomas J., and John J. Stevens. 2014. "An Alternative Theory of the Plant Size Distribution, with Geography and Intra- and International Trade." Journal of Political Economy 122 (2): 369–421. Hottman, Colin J., Stephen J. Redding, and David E. Weinstein. 2016. "Quantifying the Sources of

Firm Heterogeneity." Quarterly Journal of Economics 131 (3): 1291–1364.

Hummels, David. 2001. "Toward a Geography of Trade Costs." http://www.krannert.purdue.edu/

faculty/hummelsd/research/toward/tgtc.pdf.

Hummels, David, and Peter J. Klenow. 2005. "The Variety and Quality of a Nation's Exports." Ameri-

can Economic Review 95 (3): 704–23.

Hummels, David, and Volodymyr Lugovskyy. 2009. "International Pricing in a Generalized Model of

Ideal Variety." Journal of Money, Credit and Banking 41 (S1): 3–33.

Hummels, David, and Alexandre Skiba. 2004. "Shipping the Good Apples Out? An Empirical Confir-

mation of the Alchian-Allen Conjecture." Journal of Political Economy 112 (6): 1384–1402.

Imbs, Jean, and Isabelle Mejean. 2015. "Elasticity Optimism." American Economic Journal: Macro-

economics 7 (3): 43–83.

Johnson, Robert C. 2012. "Trade and Prices with Heterogeneous Firms." Journal of International Eco-

nomics 86 (1): 43–56.

Jones, Charles I. 2011. "Intermediate Goods and Weak Links in the Theory of Economic Develop-

ment." American Economic Journal: Macroeconomics 3 (2): 1–28.

Khandelwal, Amit. 2010. "The Long and Short (of) Quality Ladders." Review of Economic Studies 77

(4): 1450–76.

Kimball, Miles S. 1995. "The Quantitative Analytics of the Basic Neomonetarist Model." NBER Work-

ing Paper 5046.

Kleibergen, Frank, and Richard Paap. 2006. "Generalized Reduced Rank Tests Using the Singular

Value Decomposition." Journal of Econometrics 133 (1): 97–126.

Klenow, Peter J., and Andrés Rodríguez-Clare. 1997. "The Neoclassical Revival in Growth Econom-

ics: Has It Gone Too Far?" NBER Macroeconomics Annual 12: 73–103.

Klenow, Peter J., and Jonathan L. Willis. 2016. "Real Rigidities and Nominal Price Changes." Eco-

nomica 83 (331): 443–72.

Kraay, Aart, and Jaume Ventura. 2007. "Comparative Advantage and the Cross-Section of Business

Cycles." Journal of the European Economic Association 5 (6): 1300–33.

Kucheryavyy, Konstantin, Gary Lyn, and Andrés Rodríguez-Clare. 2016. "Grounded by Gravity: A Well- Behaved Trade Model with Industry-Level Economies of Scale." NBER Working Paper 22484. Laffont, Jean-Jacques, and Jean Tirole. 1993. A Theory of Incentives in Procurement and Regulation.

Cambridge, MA: MIT Press.

Lashkaripour, Ahmad, and Volodymyr Lugovskyy. 2019. "Scale Economies and the Structure of Trade

and Industrial Policy." pages.iu.edu/~alashkar/Lashkaripour_Lugovskyy.pdf.

Levchenko, Andrei A., and Jing Zhang. 2014. "Ricardian Productivity Differences and the Gains from

Trade." European Economic Review 65: 45–65.

Lugovskyy, Volodymyr, and Alexandre Skiba. 2015. "How Geography Affects Quality." Journal of

Development Economics 115: 156–80.

Matsuyama, Kiminori, and Philip Ushchev. 2017. "Beyond CES: Three Alternative Cases of Flexible

Homothetic Demand Systems." Buffett Institute Global Poverty Research Lab Working Paper 17-109. Mayer, Thierry, and Soledad Zignago. 2011. "Notes on CEPII's Distances Measures: The GeoDist

Database." CEPII Working Paper 2011-25.

McCalman, Philip, and Alan Spearot. 2013. "Why Trucks Jump: Offshoring and Product Characteris-

tics." Journal of International Economics 91 (1): 82–95.

Melitz, Marc J., and Gianmarco I.P. Ottaviano. 2008. "Market Size, Trade, and Productivity." Review

of Economic Studies 75 (1): 295–316.

Melitz, Marc J., and Stephen J. Redding. 2014. "Chapter 1—Heterogeneous Firms and Trade." In Handbook of International Economics, Vol. 4, edited by Gita Gopinath, Elhanan Helpman, and Kenneth Rogoff, 1–54. Amsterdam: North-Holland.

Nicita, Alessandro, and Marcelo Olarreaga. 2007. "Trade, Production, and Protection Database, 1976–

2004." World Bank Economic Review 21 (1): 165–71.

Ossa, Ralph. 2015. "Why Trade Matters After All." Journal of International Economics 97 (2): 266–77. Ottaviano, Gianmarco, Takatoshi Tabuchi, and Jacques-François Thisse. 2002. "Agglomeration and

Trade Revisited." International Economic Review 43 (2): 409–35.

Parente, Stephen L., and Edward C. Prescott. 1999. "Monopoly Rights: A Barrier to Riches." American Economic Review 89 (5): 1216–33.
Parente, Stephen L., and Edward C. Prescott. 2000. Barriers to Riches. Cambridge, MA: MIT Press. Prescott, Edward C. 1998. "Lawrence R. Klein Lecture 1997: Needed: A Theory of Total Factor Pro-

ductivity." International Economic Review 39 (3): 525–51.

Rodrik, Dani. 1994. "Industrial Organization and Product Quality: Evidence from South Korean and Taiwanese Exports." In Empirical Studies of Strategic Trade Policy, edited by Paul Krugman and Alasdair Smith, 195–210. Chicago: University of Chicago Press.

Romalis, John. 2007. "NAFTA's and CUSFTA's Impact on International Trade." Review of Economics

and Statistics 89 (3): 416–35.

Schott, Peter K. 2004. "Across-Product versus Within-Product Specialization in International Trade."

Quarterly Journal of Economics 119 (2): 647–78.

Simonovska, Ina. 2015. "Income Differences and Prices of Tradables: Insights from an Online

Retailer." Review of Economic Studies 82 (4): 1612–56.

Simonovska, Ina, and Michael E. Waugh. 2011. "The Elasticity of Trade: Estimates and Evidence."

NBER Working Paper 16796.

Spearot, Alan C. 2013. "Variable Demand Elasticities and Tariff Liberalization." Journal of Interna-

tional Economics 89 (1): 26–41.

Stock, James H., and Motohiro Yogo. 2002. "Testing for Weak Instruments in Linear IV Regression."

NBER Working Paper 284.

Sutton, John. 2007. "Quality, Trade and the Moving Window: The Globalisation Process." Economic

Journal 117 (524): F469–98.

Sutton, John, and Daniel Trefler. 2016. "Capabilities, Wealth, and Trade." Journal of Political Econ-

omy 124 (3): 826–78.

Timmer, Marcel, Abdul A. Erumban, Reitze Gouma, Bart Los, Umed Temurshoev, Gaaitzen J. de Vries, Iñaki Arto, et al. 2012. "The World Input-Output Database (WIOD): Contents, Sources and Methods." World Input-Output Database (WIOD) Working Paper 10.

UN Comtrade Database 2010. United Nations Commodity Trade Statistics Database. http://comtrade.

un.org.

Verhoogen, Eric A. 2008. "Trade, Quality Upgrading, and Wage Inequality in the Mexican Manufac-

turing Sector." Quarterly Journal of Economics 123 (2): 489–530.

Waugh, Michael E. 2010. "International Trade and Income Differences." American Economic Review

100 (5): 2093–2124.

World Bank. 2012. World Development Indicators 2012. Washington, DC: World Bank.