Difference between revisions of "SocioPolitical"
CamrynDreyer (Talk  contribs) 

(4 intermediate revisions by 2 users not shown)  
Line 7:  Line 7:  
= <span style="fontsize:xxlarge;">Structure and Agent System: SocioPolitical</span> =  = <span style="fontsize:xxlarge;">Structure and Agent System: SocioPolitical</span> =  
−  { class="tableGrid" style="width: 100%" cellspacing="0" cellpadding="5" border="  +  { class="tableGrid" style="width:100%;" cellspacing="0" cellpadding="5" border="1" 
    
 style="width: 50%"  <div>'''System/Subsystem'''</div>   style="width: 50%"  <div>'''System/Subsystem'''</div>  
Line 93:  Line 93:  
Being electronically networked is an increasingly important aspect of human life condition. The number of networked persons (NUMNWP) is a function primarily of the growth rate in that number (NUMNWPGR). It is ultimately constrained, however, by the size of the population and by the number of connections and organizational memberships that people can have (numnwplim). The growth in networked person number slows as it approaches the ultimate limit. The model user can affect the growth pattern via a multiplier on the growth rate (numnwpgrm).  Being electronically networked is an increasingly important aspect of human life condition. The number of networked persons (NUMNWP) is a function primarily of the growth rate in that number (NUMNWPGR). It is ultimately constrained, however, by the size of the population and by the number of connections and organizational memberships that people can have (numnwplim). The growth in networked person number slows as it approaches the ultimate limit. The model user can affect the growth pattern via a multiplier on the growth rate (numnwpgrm).  
−  This approach was added to IFs during the TERRA project and draws on the thinking of Tom Tesch and Pol Descamps.[[File:  +  This approach was added to IFs during the TERRA project and draws on the thinking of Tom Tesch and Pol Descamps. 
+  
+  [[File:NetworkedPersons2.gifframecenterVisual representation of networking.]]  
== <span style="fontsize:xlarge;">Social Values and Cultural Evolution</span> ==  == <span style="fontsize:xlarge;">Social Values and Cultural Evolution</span> ==  
Line 508:  Line 510:  
*SFIMBAL (structural imbalances)  *SFIMBAL (structural imbalances)  
−  ===  +  === Input variables needed to compute the probabilities === 
{ border="1" cellspacing="1" cellpadding="1" width="0" style="width:576px;" align="center"  { border="1" cellspacing="1" cellpadding="1" width="0" style="width:576px;" align="center"  
Line 815:  Line 817:  
== Drug Model Equations ==  == Drug Model Equations ==  
−  We  +  We use linear regressions for each of the variables described above. We fit this linear equation to logistic curves to derive the final prevalence rate. The methodology used here is similar to what is used in the water and sanitation model in the International Futures tool to compute access to water and sanitation.<ref>Rothman, D.S. and Irfan M.T, IFs infrastructure model documentation, Working Paper 2013.07.22, Josef Korbel School of International Studies, University of Denver, Denver CO. https://pardee.du.edu/ifsinfrastructuremodeldocumentation</ref> 
The values are computed using the equations given below,  The values are computed using the equations given below,  
Line 852:  Line 854:  
The values for drug prevalence are initialized using illicit drug demand data from the UNODC. However, data availability from this source is low. Appendix II shows the data coverage across countries from the UNODC. Therefore, filling holes for the first year where no data is available is crucial. There are three options available to the user when filling holes. They are,  The values for drug prevalence are initialized using illicit drug demand data from the UNODC. However, data availability from this source is low. Appendix II shows the data coverage across countries from the UNODC. Therefore, filling holes for the first year where no data is available is crucial. There are three options available to the user when filling holes. They are,  
<ol style="liststyletype:loweralpha;">  <ol style="liststyletype:loweralpha;">  
−  <li>'''Using IHME equations to fill holes '''The institute for health and metric evaluation also provides data on drug prevalence and this source has much higher coverage (184 countries from 1990 to 2016). However, this data pertains to treatment of drug prevalence. We developed regression equations to estimate levels of illicit drug use from the IHME drug prevalence data set. Appendix  +  <li>'''Using IHME equations to fill holes '''The institute for health and metric evaluation also provides data on drug prevalence and this source has much higher coverage (184 countries from 1990 to 2016). However, this data pertains to treatment of drug prevalence. We developed regression equations to estimate levels of illicit drug use from the IHME drug prevalence data set. Appendix III describes these regression equations in detail. </li> 
<li>'''Using forecast year equations  '''This method uses the forecast year equations to derive the drug prevalence value for the first year of the model.</li>  <li>'''Using forecast year equations  '''This method uses the forecast year equations to derive the drug prevalence value for the first year of the model.</li>  
<li>'''Using regional averages from the UNODC '''Alternatively, we can also use regional averages for illicit drug prevalence to fill in holes for individual countries. </li>  <li>'''Using regional averages from the UNODC '''Alternatively, we can also use regional averages for illicit drug prevalence to fill in holes for individual countries. </li>  
Line 899:  Line 901:  
2.3 is the cap on drug prevalence for amphetamines. <ref>These caps have been chosen on the basis of the highest historical global prevalence rates</ref>  2.3 is the cap on drug prevalence for amphetamines. <ref>These caps have been chosen on the basis of the highest historical global prevalence rates</ref>  
−  '''AMIN''' is the function used to get the minimum value of drug prevalence and the cap (2.3) Since prevalence of drug usage tends to be slow moving over time, we have also capped the rate of growth of the prevalence rate for all four drug types. The growth rate in the drug prevalence rate is capped at 5 percent for every country for every year. However, this growth rate is not applicable when the parameters on drug prevalence rate are activated by a user.  +  '''AMIN''' is the function used to get the minimum value of drug prevalence and the cap (2.3). Since prevalence of drug usage tends to be slow moving over time, we have also capped the rate of growth of the prevalence rate for all four drug types. The growth rate in the drug prevalence rate is capped at 5 percent for every country for every year. However, this growth rate is not applicable when the parameters on drug prevalence rate are activated by a user. 
Finally, total drug use is computed as the average of the four drug types divided by a '''drugusepolyindex''' parameter which is set to 1.2. This is done to account for users who use multiple drugs.  Finally, total drug use is computed as the average of the four drug types divided by a '''drugusepolyindex''' parameter which is set to 1.2. This is done to account for users who use multiple drugs.  
Line 905:  Line 907:  
=== Adjusting Drug Use Using the Top Down Approach ===  === Adjusting Drug Use Using the Top Down Approach ===  
−  The paragraph above described the computation of drug prevalence using the bottom up approach i.e. drug prevalence is computed for each drug type individually and this is used to compute total drug demand. However, another approach to computing drug demand would be to compute total drug demand first and distribute that across drug types i.e. a top down approach. The model computes total drug demand using this top down approach and then converges the drug demand computed through the bottom up approach to the same  +  The paragraph above described the computation of drug prevalence using the bottom up approach (i.e. drug prevalence is computed for each drug type individually and this is used to compute total drug demand). However, another approach to computing drug demand would be to compute total drug demand first and distribute that across drug types (i.e. a top down approach). The model computes total drug demand using this top down approach and then converges the drug demand computed through the bottom up approach to the same 
The top down model uses youth bulge and household consumption as the two main drivers.  The top down model uses youth bulge and household consumption as the two main drivers.  
Line 915:  Line 917:  
:<math>DruguseEst_{R}=100*\frac{(e^{(z_{druguseest} )}}{1+e^{(z_{druguseest} )}}</math>  :<math>DruguseEst_{R}=100*\frac{(e^{(z_{druguseest} )}}{1+e^{(z_{druguseest} )}}</math>  
−  The total drug use from the bottom up approach is converged to the above value over a period of 100 years. Note that there is a restriction on the  +  The total drug use from the bottom up approach is converged to the above value over a period of 100 years. Note that there is a restriction on the year growth and decline rate of total drug use of 2%. 
== Violence Model Equations ==  == Violence Model Equations ==  
Line 923:  Line 925:  
In the preprocessor, each of the violence variables are initialized using death rate data from the Institute for Health and Metric Evaluation (IHME). Please note that we only forecast mortality and the model currently does not have a representation of the prevalence of violence.  In the preprocessor, each of the violence variables are initialized using death rate data from the Institute for Health and Metric Evaluation (IHME). Please note that we only forecast mortality and the model currently does not have a representation of the prevalence of violence.  
−  For the conflict deaths, instead of using the latest data point for initialization, we use a weighted average of conflict deaths from the previous 10 years which is then divided by  +  For the conflict deaths, instead of using the latest data point for initialization, we use a weighted average of conflict deaths from the previous 10 years which is then divided by two to generate a more realistic number for the initialization. 
Where no data is available for any particular type of violence, we use the forecast equations to fill in holes for the first year of the model.  Where no data is available for any particular type of violence, we use the forecast equations to fill in holes for the first year of the model.  
−  In the first year of the model, we need to make sure that the total deaths from violence matches the total deaths from intentional injuries in the health model. Hence we normalize the total violence deaths to the total intentional injuries deaths. Please note that this normalization is optional  +  In the first year of the model, we need to make sure that the total deaths from violence matches the total deaths from intentional injuries in the health model. Hence we normalize the total violence deaths to the total intentional injuries deaths. Please note that this normalization is optional (i.e. the user can activate a switch '''''svvionormsw'''''). The normalization will also be activated in the event the user turns on the forward linkage switch from the violence model to the health model '''''svtohlsw'''''. 
−  For the normalization we first calculate the total deaths from intentional injuries in the health model. This term is called the AdjustedViolenceTerm. Now, we calculate the total deaths from the violence model and call this tem SVTerm. The deaths from the violence model are now normalized to the deaths from the health model using the equations below (The below equation is used for normalizing conflict deaths. Similar equations are used for the other types of violence),  +  For the normalization we first calculate the total deaths from intentional injuries in the health model. This term is called the AdjustedViolenceTerm. Now, we calculate the total deaths from the violence model and call this tem SVTerm. The deaths from the violence model are now normalized to the deaths from the health model using the equations below. (The below equation is used for normalizing conflict deaths. Similar equations are used for the other types of violence), 
Line 943:  Line 945:  
=== Forecast Years ===  === Forecast Years ===  
−  In the forecast years  +  In the forecast years estimated values are calculated using forecast equations for each type of violence. The forecast equations have been explained in Table 1 below. Each of the types of violence are calculated using this estimated value and the respective shift factor calculated in the first year of the model and the multipliers on the death rates are applied. 
The equations used are as follows,  The equations used are as follows,  
Line 967:  Line 969:  
Where,  Where,  
−  ConflictEst, HomicideEst, WomenandChilEst, PoliceEst and SelfHarmEst are the estimated level deaths calculated using the forecast equations.  +  ConflictEst, HomicideEst, WomenandChilEst, PoliceEst and SelfHarmEst are the estimated level of deaths calculated using the forecast equations. 
ConflictShift, HomicideShift, WomenandChilShift, PoliceShift and SelfHarmShift are the shift factors calculated in the first year of the model.  ConflictShift, HomicideShift, WomenandChilShift, PoliceShift and SelfHarmShift are the shift factors calculated in the first year of the model.  
Line 1,085:  Line 1,087:  
−  +  After this, the total number of deaths are calculated for each category. For this purpose, we first calculate the total populations for adult males, women and children from the population model as '''AdultMaleTerm''', '''WomenTerm''' and '''ChildrenTerm''' respectively. Next, we calculate the total number of deaths for each of the categories and apply the additive parameters on total deaths ('''''svdthsadd''''') as follows,  
:<math>SVDTHSOTHERINTERTOT_{R}=(SVDTHSOTHERINTERPERSON_{R}/100000)*AdultMaleTerm)+svdthsadd_{R,5}</math>  :<math>SVDTHSOTHERINTERTOT_{R}=(SVDTHSOTHERINTERPERSON_{R}/100000)*AdultMaleTerm)+svdthsadd_{R,5}</math>  
Line 1,113:  Line 1,115:  
<math>+SVDTHSWOMENANDCHILTOT_{R}+SVDTHSSELFHARMTOT_{R})+ svdthsadd_{R,6}</math>  <math>+SVDTHSWOMENANDCHILTOT_{R}+SVDTHSSELFHARMTOT_{R})+ svdthsadd_{R,6}</math>  
−  +  Because we have applied additive parameters above, we perform a recalculation of the total death rates using the total number of deaths from each category of violence.  
We now calculate the total death rate from societal violence,  We now calculate the total death rate from societal violence, 
Latest revision as of 01:56, 24 September 2018
Please cite as: Hughes, Barry B., and José R. Solórzano. 2014. "IFs Governance and SocioCultural Model Documentation ." Working paper 2014.03.05.a. Pardee Center for International Futures, Josef Korbel School of International Studies, University of Denver, Denver, CO. Accessed DD Month YYYY <https://pardee.du.edu/wiki/SocioPolitical>
A substantial portion of the sociopolitical model of IFs is scattered throughout the other models. There are "policy handles" or intervention points throughout those models. For instance, in the population model, multipliers on the total fertility rate can reflect policy decisions (although they can also reflect the model user's judgment concerning social changes in the country or region, independent of policy). Patterns of regulation, subsidy, tax incidence, and provision of state services are so diffuse and complicated that we resort to looking at their aggregate consequences through various "policy handles" rather than trying to represent them explicitly.
For more information on this module, please use the links below or read more at SocioPolitical Equations Overview.
Contents
 1 Structure and Agent System: SocioPolitical
 2 Dominant Relations: Sociopolitical
 3 Sociopolitical Flow Charts
 3.1 Overview
 3.2 Social Characteristics: Life Conditions
 3.3 Physical Quality of Life (PQLI)
 3.4 Income Distribution
 3.5 Social Characteristics: Networking
 3.6 Social Values and Cultural Evolution
 3.7 Social Organization and Change
 3.8 Social Organization: Stability/State Failure
 3.9 Government Spending
 3.10 Drug Demand
 3.11 Violence
 4 Sociopolitical Equations
 4.1 Overview
 4.2 Sociopolitical Equations: Life Conditions
 4.3 Sociopolitical Equations: Income Distribution
 4.4 Social Equations Networking
 4.5 Sociopolitical Equations: Values
 4.6 Sociopolitical Equations: Structures or Institutions
 4.7 Sociopolitical Equations: Stability/State Failure
 4.8 Probability of state failure from different causes
 4.9 Economic Inequality and Political Conflict
 4.10 Drug Model Equations
 4.11 Violence Model Equations
 4.12 Policy Equations: Government Expenditures
 4.13 Policy Equations: Foreign Aid
 5 References
Structure and Agent System: SocioPolitical
System/Subsystem

Sociopolitical

Organizing Structure

Social fabric

Stocks

Levels of human wellbeing and institutional development (human and social capital) Cultural structures

Flows

Social expenditures Value change

Key Aggregate Relationships (illustrative, not comprehensive)

Growth in literacy and human development; Democratic development, state failure

Key AgentClass Behavior Relationships (illustrative, not comprehensive)

Government efforts to develop human capital through spending on health, education, R&D

Unlike the use of cohortcomponent structures in demographics and of markets and social accounting matrices for economics, there is no standard organizing structure that is widely used for representing sociopolitical systems. In the context of the TERRA project, IFs developed a multicomponent approach to structure that might be called the "social fabric" (a la Robert Pestel).
Although representation of agentclass behavior would be of special interest in a sociopolitical module, most relationships in IFs remain at the level of aggregate specifications.
Dominant Relations: Sociopolitical
Domestic SocioPolitical Change: Dominant Relations
Social and political change occurs on three dimensions (social characteristics or individual life conditions, values, sociopolitical institutions and process). Although GDP per capita is strongly correlated with all dimensions of change, it might be more appropriate to conceptualize a syndrome or complex of developmental change than to portray an economicallydriven process.^{[1]}
For causal diagram see SocioPolitical Flow Charts Overview.
For equations see, for example, SocioPolitical Equations Overview.
Key dynamics are directly linked to the dominant relations
 The model computes some key social characteristics/life conditions, including life expectancy and fertility rates in the demographic model, but the user can affect them via multipliers (mortm, tfrm). Literacy rate is an endogenous function of education spending, which the user can influence (via gdsm).
 The model computes value or cultural change on three dimensions: traditional versus secularrational, survival versus selfexpression, and modernism versus postmodernism, which the user can affect via additive factors (tradsrateadd, survseadd, matpostradd).
 Freedom, democracy (the POLITY measure), autocracy, economic freedom, and the status of women are all computed endogenously but can all be shifted by the user via multipliers (freedomm, democm, autocm, econfreem, gemm)
Domestic SocioPolitical Change: Selected Added Value
The larger sociopolitical model provides representation and control over government spending on education, health, the military, R&D, foreign aid, and a residual category. Military spending is linked to interstate politics, both as a driver of threat and as a result of actionandreaction based arms spending. The submodel provides aggregated indicators of the physical quality of life and the human development index.
Sociopolitical Flow Charts
Overview
The social and political module represents a complex of interacting structures and processes. These include:
 The various social characteristics or life conditions of individuals
 Human values, beliefs, and orientations’
 Social and political structures, informal as well as formal
 Social and political processes, both domestic and international
Cultural foundations frame all of these components. And all of the components interact closely with human demographic and economic systems.
The sociopolitical elements of IFs are among the most dynamically evolving aspects of the overall modeling system. Much, but not everything in the above figure has been fully represented yet within IFs; the figure indicates direction of development and shows implemented elements in italics.
For more, please read the links below.
Social Characteristics: Life Conditions
Individuals are the foundations of society. Many social indicators are actually aggregated indicators of their condition. The Human Development Index (HDI) is a widelyused summary measure of that life condition, based on life expectancy, educational attainment, and GDP per capita.
Physical Quality of Life (PQLI)
The Overseas Development Council (then under the leadership of Jim Grant) developed and publicized a measure of (physical) quality of life (the PQLI) many years ago. It combines literarcy rate, infant mortality rate, and life expectancy, using scales from the lowest to the highest values in the global system. It weights the three scales equally. The literacy rate is, in turn, a function of the per capita spending levels on education, estimated crosssectionally. In many respects the PQLI was a predecessor of the human development index (HDI).Based on country/regionspecific Physical Quality of Life, it is possible to compute world quality of life (WPQLI) and the NorthSouth gap in quality of life (NSPQLI). Given countryspecific literacy rates, it is also possible to compute world literacy (WLIT).
Income Distribution
Income distribution is represented by the share of national income earned by the poorest 20 percent of the population. That share is obtained from data whenever possible, but is estimated from a crosssectional relationship when necessary and changed over time by that relationship (the values tend, however, to be very stable both in the real world and in the model). Because initial conditions of variables affected by income share, such as fertility and mortality rates, already reflect existing income distributions, it is only the changes in that distribution relative to the expected value that the model uses in such relationships. A parameter (incshrm) is available to change income share and thus affect those variables influenced by it.
Social Characteristics: Networking
Being electronically networked is an increasingly important aspect of human life condition. The number of networked persons (NUMNWP) is a function primarily of the growth rate in that number (NUMNWPGR). It is ultimately constrained, however, by the size of the population and by the number of connections and organizational memberships that people can have (numnwplim). The growth in networked person number slows as it approaches the ultimate limit. The model user can affect the growth pattern via a multiplier on the growth rate (numnwpgrm).
This approach was added to IFs during the TERRA project and draws on the thinking of Tom Tesch and Pol Descamps.
Social Values and Cultural Evolution
IFs computes change in three cultural dimensions identified by the World Values Survey (Inglehart 1997). Those are dimensions of materialism/postmaterialism, survival/selfexpression, and traditional/secularrational values.
Inglehart has identified large cultural regions that have substantially different patterns on these value dimensions and IFs represents those regions, using them to compute shifts in value patterns specific to them.
Levels on the three cultural dimensions are predicted not only for the country/regional populations as a whole, but in each of 6 age cohorts. Not shown in the flow chart is the option, controlled by the parameter "wvsagesw," of computing country/region change over time in the three dimensions by functions for each cohort (value of wvsagesw = 1) or by computing change only in the first cohort and then advancting that through time (value of wvsagesw = 2).
The model uses countryspecific data from the World Values Survey project to compute a variety of parameters in the first year by cultural region (Englishspeaking, Orthodox, Islamic, etc.). The key parameters for the model user are the three country/regionspecific additive factors on each value/cultural dimension (matpostradd, etc.).
Finally, the model contains data on the size (percentage of population) of the two largest ethnic/cultural groupings. At this point these parameters have no forward linkages to other variables in the model.
Social Organization and Change
The sociopolitical module computes change in freedom (political and economic) and the status of women. For freedom it uses both the measure of the Freedom House and the combined measure for democracy (building on democracy and autocracy) of the POLITY project. It also computes a measure of economic freedom and of gender equality.Social Organization: Stability/State Failure
The State Failure project has analyzed the propensity for different types of state failures within countries, including those associated with revolution, ethnic conflict, genocidepoliticide, and abrupt regime change (using categories and data pioneered by Ted Robert Gurr. Upon the advice of Gurr, IFs groups the first three as internal war and the last as political instability.
IFs uses the same primary variables (infant mortality, democracy, and trade openness) as the State Failure project to drive forecasts of the probability of individual events of state failure, of ongoing episodes of it, and of the magnitude of episodes. In addition, it allows the use in the formulation of GDP per capita and years of education. Many other linkages have been and can be explored, including cultural regions.
Government Spending
The economic submodel provides total government spending. Government spending by category begins as a simple product of total government consumption and fractional shares by spending category.
Spending by type (military, health, education, research and development, other, and foreign aid) is largely specified exogenously, building on the initial conditions for each country/region. In addition, an actionreaction (armsrace) dynamic can be established in military spending if the actionreaction switch is turned on. After adjustments to foreign aid and military spending, spending in all categories is renormalized to equal total governmental spending.
Educational spending is further broken out of total educational spending. The user can shift the spending across three educational levels (primary, secondary, and tertiary) through the use of an educational multiplier.See also the specifications of detailed final demand and of international finance.
Drug Demand
The UNODC drug report finds that illicit drug use is concentrated amongst the youth, notably young males living in an urban environment. The UNODC report also finds a pronounced gender gap in relation to illicit drug consumption. Gender equality and empowerment seems to act as a key driver when it comes to determining drug consumption. For example, in the United States, characterized by a small gender gap, female drug use is about two thirds that of males, whereas in some other countries, including India and Indonesia, female drug use is as low as one tenth that of males, though there is a risk that female drug use may be underreported.
In addition, we have also found poverty, inequality and government health expenditure as drivers of specific types of drug prevalence. Policy options with respect to drug prevalence are represented in the model using multipliers which can be used to simulate an increase or decrease in drug prevalence. The table below lists the driving variables for each of the drug types.
Drug Type  Driving Variables  Driving Variables in IFS 
Amphetamines 
Youth Bulge, Gender Inequalities  YTHBULGE, GEM 
Cocaine  Consumption levels, Gender Empowerment Measure and Income Inequality  (C/POP), GEM, GINIDOM 
Opiates  Poverty, Youth Bulge and Urban Population  INCOMELT310LN, YTHBULGE, POPURBAN 
Prescription Opiods  Health Expenditure  HLEXPEND 
The figure below shows a diagrammatic representation of the drug demand model in IFs,
Violence
Mortality from conflict is driven using the probability of internal war (SFINTLWARALL). Mortality from homicides and violence against women and children are driven using the youthbulge (YTHBULGE) and the GINI coefficient (GINIDOM). Police violence deaths are driven by homicides(SVDTHSOTHERINTERPERSON) and the Corruption index in IFs (GOVCORRUPT). Finally, mortality from selfharm is calculated using mental health deaths (which are calculated in the health model) and deaths of women and children (SVDTHSWOMENANDCHILDREN). There are user controllable parameters available in the model to increase the death rates (svmulm) and the total number of deaths (svdthsadd) for each of the categories of violence. Finally, the homicide index(HOMICIDEINDEX) is calculated using each of the death rates mentioned above excluding selfharm. The homicide index itself is used in computing a conflict component of the security index in IFs (GOVINDSECUR).
The figure below shows a visual representation of the violence model in IFs.
Sociopolitical Equations
Overview
A substantial portion of the policy model of IFs is scattered throughout the other models. There are "policy handles" or intervention points throughout those models. For instance, in the population model, multipliers on the total fertility rate can reflect policy decisions (although they can also reflect the model user's judgment concerning social changes in the country or region, independent of policy). Similarly, in the energy model, the multiplier on energy demand can represent conservation policy. Similarly, the ultimate energy resource base and the rate of resource discovery remain uncertain in part because they are subject to a wide range of government interventions  multipliers can introduce assumptions about such interventions. In the economic module, the level of trade protection is very clearly a policy parameter as is the multiplier on the tax rate. Patterns of regulation, subsidy, tax incidence, and provision of state services are so diffuse and complicated that we resort to looking at their aggregate consequences through various "policy handles" rather than trying to represent them explicitly.
IFs contains other categories of sociolpolitical activity, however, that it represents in more integrated fashion in the sociopolitical module as a fourdimensional social fabric: social characteristics/life condition, values, social structures (formal and informal), and social processes.
For help understanding the equations see Notation.
Sociopolitical Equations: Life Conditions
Literacy changes from the initial level for the region because of a multiplier (LITM).
 $ LIT_{\gamma}=\mathbf{LIT}^{t=1}_{\gamma}*LITM_{\gamma} $
The function upon which the literacy multiplier is based represents the cross sectional relationship globally between educational expenditures per capita (EDEX) from the government submodel and literacy rate (LIT). Rather than imposing the typical literacy rate on a region (and thereby being inconsistent with initial empirical values), the literacy multiplier is the ratio of typical literacy at current expenditure levels to the normal literacy level at initial expenditure levels. This formulation predates the development of an educational module that calculates the numbers of those with a primary education (one common definition of literacy). As that module is refined, we will likely derive literacy dynamics from it.
 $ LITM=\frac{AnalFunc(EDEX)}{AnalFunc(\mathbf{EDEX}^{t=1})} $
Educational expenditures (and thus implicitly literacy and labor efficiency) are tied back to the economic model via the economic production function.
Given life expectancy, literacy, and infant mortality levels from the mortality distribution, it is possible to compute the Physical Quality of Life Index (PQLI) that the Overseas Development Council developed (ODC, 1977: 147#154). This measure averages the three quality of life indicators, first normalizing each indicator so that it ranges from zero to 100. The normaliza"tion is not needed for literacy; for life expectancy it converts the range of approximately 28 (LIFEXPMIN) to 80 (LIFEXPMAX) years into 0 to 100; for infant mortality it converts the range of approximately 229 per thousand (INFMORMAX) to 9 per thousand (INFMORMIN) into 0 to 100.
 $ PQLI_{\gamma}=\frac{LIT_{\gamma}+\frac{LIFEXP_{\gamma}\mathbf{lifexpmin}}{LifExpMax\mathbf{lifexpmin}}*100+\frac{\mathbf{infmormax}MORDST_{\gamma,c1}}{\mathbf{infmormax}InfMorMin}*100}{300} $
where
 $ LifExpMax=Max(LIFEXP^t_{\gamma}) $
 $ InfMorMin=Min(INFMOR^t_{\gamma}) $
For most users, the United Nations Development Program’s human development index (HDI) has replaced the PQLI as an integrated measure of life condition. It is a simple average of three subindices for life expectancy, education, and GDP per capita (using purchasing power parity). The life expectancy subindex is the same as was used for the PQLI. The literacy subindex is again the literacy rate. The GDP per capita index is a logged form that runs from a minimum of 100 to a maximum of $40,000 per capita. The measure in IFs differs slightly from the HDI version, because it does not put educational enrollment rates into a broader educational index with literacy; that will be changed as the educational model of IFs is better tested.
 $ HDI_{\gamma}=\frac{LifeExpInd_{\gamma}+LitInd+GDPInd}{3} $
where
 $ LifeExpInd=\frac{LIFEXP_{\gamma}LIFEXPMIN}{LIFEXPMAXLIFEXPMIN} $
 $ LitInd=LIT_{\gamma}/100 $
 $ GDPInd=\frac{Log(GDPPCP_{\gamma}*1000)Log(100)}{Log(40000)Log(100)} $
Although the HDI is a wonderful measure for looking at past and current life conditions, it has some limitations when looking at the longerterm future. Specifically, the fixed upper limits for life expectancy and GDP per capita are likely to be exceeded by many countries before the end of the 21st century. IFs has therefore introduced a floating version of the HDI, in which the maximums for those two index components are calculated from the maximum performance of any state in the system in each forecast year.
 $ HDIFLOAT_{\gamma}=\frac{LifeExpInd_{\gamma}+LitInd+GDPInd}{3} $
where
 $ LifeExpInd=\frac{LIFEXP_{\gamma}LIFEXPMIN}{HDILIFEMAXFLOATLIFEXPMIN} $
 $ LitInd=LIT_{\gamma}/100 $
 $ GDPInd=\frac{Log(GDPPCP_{\gamma}*1000)Log(100)}{Log(GDPPCPMAX)Log(100)} $
The floating measure, in turn, has some limitations because it introduces relative attainment into the equation rather than absolute attainment. IFs therefore uses still a third version of the HDI, one that allows the users to specify probable upper limits for life expectancy and GDPPC in the twentyfirst century. Those enter into a fixed calculation of which the normal HDI could be considered a special case.
 $ HDI21stFIX_{\gamma}=\frac{LifeExpInd_{\gamma}+LitInd+GDPInd}{3} $
where
 $ HDILIFEMAX21=\mathbf{hdilifemaxf} $
 $ LifeExpInd=\frac{LIFEXP_{\gamma}LIFEXPMIN}{HDILIFEMAX21LIFEXPMIN} $
 $ LitInd=LIT_{\gamma}/100 $
 $ Log(GDPPCP21)=Log(\mathbf{hdigdppcmax}*1000) $
 $ GDPInd=\frac{Log(GDPPCP_{\gamma}*1000)Log(100)}{Log(GDPPCP21)Log(100)} $
It is useful to compute several additional global indicators, a world physical quality of life index (WPQLI), a world life expectancy (WLIFE), a world literacy rate (WLIT), and a North#South gap index or ratio of quality of life in the "developed D" regions to the "less developedL" regions (NSPQLI).
 $ WPQLI=\frac{\sum^RPQLI_{\gamma}*POP_{\gamma}}{WPOP} $
 $ WLIFE=\frac{\sum^RLIFEXP_{\gamma}*POP_{\gamma}}{WPOP} $
 $ WLIT=\frac{\sum^RLIT_{\gamma}*POP_{\gamma}}{WPOP} $
 $ NSPQLI=\frac{\frac{\sum^DPQLI_{\gamma}*POP_{\gamma}}{\sum^DPOP_{\gamma}}}{\frac{\sum^LPQLI_{\gamma}*POP_{\gamma}}{\sum^LPOP_{\gamma}}} $
Sociopolitical Equations: Income Distribution
The income share of the poorest 20 percent of the population (INCSHR) depends on the GDP per capita at PPP (GDPPCP) and on an exogenous income share multiplier (incshrm).
 $ INCSHR^t_{\gamma}=INCSHR^t_{\gamma}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}*\mathbf{incshrm_{\gamma}} $
The introduction of different household types into the social accounting matrix structure of IFs made possible the computation of a more sophisticated measure of income distribution tied directly to the model’s computation of household income (HHINC) and household size (HHPOP) by type. A domestic Gini value (GINIDOM) is calculated from a function that uses the normal Lorenz curve foundation for Gini indices. Because that function can calculate values that are quite different from the empirical initial values, a ratio of the empirical value to the initial computed value (GINIDOMRI) is used for scaling purposes. The model’s formulation of the relative household income levels of different household types, and therefore the calculation of a domestic GINI based on those income levels, are in early versions and are still rather crude.
 $ GINIDOM^t_{\gamma}=GINIFUN(HHINC_{R,S},HHPOP_{R,S})*GINIDomRI^{t1}_{\gamma} $
where
 $ GINIDomRI^{t1}_{\gamma}=\frac{GINIDOM^{t1}_{\gamma}}{GINIFUN(HHINC^{{\gamma}1}_{R,S},HHPOP^{t1}_{R,S})} $
One value of a domestic Gini calculation is that it, in turn, makes possible the calculation of the percentage of population living on less than one dollar per day (INCOMELT1) or two dollars per day (INCOMELT2). Functions were estimated linking GDP per capita at purchasing power (GDPPCP) and the Gini index to those percentages. Again, IFs uses initial conditions for scaling purposes.
 $ INCOMELT1^t_{\gamma}=AnalFunc(GDPPCP_{\gamma},GINIDOM_{\gamma})*INCOMELT1RI^{t1}_{\gamma} $
where
 $ INCOMELT1RI^{t1}_{\gamma}=\frac{\mathbf{INCOMELT1}^{t1}{\gamma}}{AnalFunc(GDPPCP^{\gamma1}_{\gamma},GINIDOM^{t1}_{\gamma})} $
 $ INCOMELT2^t_{\gamma}=AnalFunc(GDPPCP_{\gamma},GINIDOM_{\gamma})*INCOMELT2RI^{t1}_{\gamma} $
where
 $ INCOMELT2RI^{t1}_{\gamma}=\frac{INCOMELT2^{t1}_{\gamma}}{AnalFunc(GDPPCP^{\gamma1}_{\gamma},GINIDOM^{t1}_{\gamma})} $
IFs also calculates a global Gini index across all countries/regions in the model, again using the standard Lorenz curve approach to areas of inequality and equality. It does not yet take into account intraregional income differentials, but the foundation is now in place to do so.
 $ GINI^t_{\gamma}=GINIFUN(GDP_R,POP_R) $
The user interface of IFs now uses the same Lorenzcurve approach to allow the user to calculate a specializeddisplay GINI for any variable that can be represented across all countries/regions of the model.
Social Equations Networking
The focal point of this portion of the model is on the computation of the total number of networked persons (NUMNWP). The rate of growth in that number (NUMNWPGR) is subject to several forces. The initial value of that rate is set in the data preprocessor of the model from empirical data. When no data are available for a country or region, the rate is set at a level determined via a crosssectional relationship between GDP per capita (PPP) and portion of population networked.
 $ NUMNWP_{\gamma}=NUMNWP^{t1}_{\gamma}*(1+NumNwGR^t_{\gamma}) $
where
 $ NumNwGR^t_{\gamma}=NUMNWPGR^{t1}_{\gamma}*(\frac{nwplmNUMNWP^{t1}_{\gamma}}{nwplmNUMNWP^{t1}_{\gamma}})^2*numnwpgrm $
 $ nwplm=numnwplim*POP_{\gamma} $
Over time the growth rate of networked persons is subject to a saturating function, as the actual number of networked persons approaches a limit. The limit is set by an exogenous multiplier (numnwplim) on total population; networked persons can exceed total population because of multiple affiliations of individuals (households, NGOs, companies). The user of the model can accelerate or deaccelerate the process of networking via a multiplier on the growth rate (numnwpgrm).
Although of interest in its own right, the number of networked persons is also carried forward in the model to the production function of the economy.
Sociopolitical Equations: Values
IFs computes change in three cultural dimensions identified by the World Values Survey (Inglehart 1997). Those are dimensions of materialism/postmaterialism (MATPOSTR), survival/selfexpression (SURVSE), and traditional/secularrational values (TRADSRAT). On each dimension the process for calculation is somewhat more complicated than for freedom or gender empowerment, however, because the dynamics for change in the cultural dimensions involves the aging of population cohorts. IFs uses the six population cohorts of the World Values Survey (1= 1824; 2=2534; 3=3544; 4=4554; 5=5564; 6=65+). It calculates change in the value orientation of the youngest cohort (c=1) from change in GDP per capita at PPP (GDPPCP), but then maintains that value orientation for the cohort and all others as they age. Analysis of different functional forms led to use of an exponential form with GDP per capita for materialism/postmaterialism and to use of logarithmic forms for the two other cultural dimensions (both of which can take on negative values).
 $ MATPOSTR_{\gamma,c1}=\mathbf{MATPOSTR}^{t1}_{\gamma,c1}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}+\mathbf{CultShMP}^t_{\gammacultural}+\mathbf{matpostradd}^t_{\gamma} $
where
 $ \mathbf{CultShMP}^t_{\gammacultural}=F(\mathbf{MATPOSTR}^{t1}_{\gamma,c1},AnalFunc(GDPPCP^{t1}_{\gamma}) $
 $ SURVSE_{\gamma,c1}=\mathbf{SURVSE}^{t1}_{\gamma,c1}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}+\mathbf{CultShSE}^t_{\gammacultural}+\mathbf{survseadd}^t_{\gamma} $
where
 $ CultShSE^t_{\gammacultural}=F(\mathbf{SURVSE}^{t1}_{\gamma,c1}, AnalFunc(GDPPCP^{t1}_{\gamma}) $
 $ TRADSRAT_{\gamma,c1}=\mathbf{TRADSRAT}^{t1}_{\gamma,c1}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}+\mathbf{CultShTS}^t_{\gammacultural}+\mathbf{tradsratadd}^t_{\gamma} $
where
 $ \mathbf{CultShTS}^t_{\gammacultural}=F(\mathbf{TRADSRAT}^{t1}_{\gamma,c1},AnalFunc(GDPPCP^{t1}_{\gamma}) $
The user can influence values on each of the cultural dimensions via two parameters. The first is a cultural shift factor (e.g. CultSHMP) that affects all of the IFs countries/regions in a given cultural region as defined by the World Value Survey. Those factors have initial values assigned to them from empirical analysis of how the regions differ on the cultural dimensions (determined by the preprocessor of raw country data in IFs), but the user can change those further, as desired. The second parameter is an additive factor specific to individual IFs countries/regions (e.g. matpostradd). The default values for the additive factors are zero.
Some users of IFs may not wish to assume that aging cohorts carry their value orientations forward in time, but rather want to compute the cultural orientation of cohorts directly from crosssectional relationships. Those relationships have been calculated for each cohort to make such an approach possible. The parameter (wvsagesw) controls the dynamics associated with the value orientation of cohorts in the model. The standard value for it is 2, which results in the "aging" of value orientations. Any other value for wvsagesw (the WVS aging switch) will result in use of the cohortspecific functions with GDP per capita.
Regardless of which approach to valuechange dynamics is used, IFs calculates the value orientation for a total region/country as a population cohortweighted average.
IFs uses an approach that is similar to the one for literacy in order to estimate the future of another measure created by the United Nations Development Program, one called the Gender Equity Measure (GEM). The closer the values of that measure approach "1", the closer women are to men in political and social power.
 $ GEM_{\gamma}=GEM^{t1}_{\gamma}*\frac{AnalFunc(GDPPC_{\gamma})}{AnalFunc(GDPPC^{t1}_{\gamma})} $
Sociopolitical Equations: Structures or Institutions
IFs endogenizes level of freedom (FREEDOM), based on the Freedom House measures, by linking change from initial conditions to GDP per capita at purchasing power parity in an analytic function. For discussion of the relationship between GDP and democracy, see Londregran and Poole (1996) and Przeworski and Limongi (1997). The latter view it as a probabilistic relationship in which there are a variety of reasons (often external pressure) at all levels of economic development for the conversion of dictatorships to democracies and in which the conversion of democracies to dictatorships occurs commonly at low but not high levels of development. That pattern creates a positive correlation between economic development and democratic government. A multiplier in freedom level (freedomm) increases or decreases the level of freedom.
 $ FREEDOM_{\gamma}=FREEDOM^{t1}_{\gamma}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(\mathbf{GDPPCP}^{t1}_{\gamma})}*\mathbf{freedomm}_{\gamma} $
The Economic Freedom Institute (with leadership from the Fraser Institute; see Gwartney and Lawson with Samida, 2000) have also introduced a measure of economic freedom. IFs represents that in similar fashion.
 $ ECONFREE_{\gamma}=ECONFREE^{t1}_{\gamma}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}*\mathbf{econfreem}_{\gamma} $
The POLITY project provides an alternative to the freedom house measure of freedom or democracy level. In fact, it provides multiple variables related to political system. IFs EARLIER included formations of two of those, democracy (DEMOC) and autocracy (AUTOC). They worked in completely analogous fashion.
 $ DEMOC_{\gamma}=DEMOC^{t1}_{\gamma}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}*\mathbf{democm}_{\gamma} $
 $ AUTOC_{\gamma}=AUTOC^{t1}_{\gamma}*\frac{AnalFunc(GDPPCP_{\gamma})}{AnalFunc(GDPPCP^{t1}_{\gamma})}*\mathbf{autocm}_{\gamma} $
More recently, IFs has (1) combined the two Polity project measures into a single one as is often done with the Polity measures, setting POLITYDEMOC equal to democracy – autocracy + 10, a measure that runs from 0 to 20; (2) introduced a more complicated, multilevel forecast for the new measure.
Specifically, the project identified three levels of analysis for factors that affect democratic change: domestic, regional, and systemic. At each of the three levels there are multiple factors that can affect democracy within states. At the domestic level we can identify two categories of factors in particular:
 GDP per capita. This variable correlates highly with almost all measures of social condition; GDP provides the resources for democratization and other social change.
 values/culture. Values clearly do differ across countries and regions of the world and almost certainly affect propensity to democratize.
At the regional level (or, more accurately, the "swingstates" level) we can also identify three prospective drivers:
 world average effects. It is possible that the world average exerts a pulleffect on states around the world (for instance, increasingly globalization could lead to homogenization of a wide variety of social structures around the world).
 swing states effects. Some states within regions quite probably affect/lead others (obviously the former Soviet Union was a prime example of such a swing state within its sphere of influence, but there is reason to believe in lesser and less coercive effects elsewhere).
 regional average. States within a region possibly affect each other more generally, such that "swing states" are moved by regional patterns and not simply movers of them.
At the system level we identify three:
 systemic leadership impetus. It is often suggested that the United States and other developed countries can affect democratization in less developed countries, either positively or negatively
 snowballing of democracy (Huntington 1991). The wave character of democratization suggests that there may be an internal dynamic, a selfreinforcing positive feedback loop, of the process globally, partially independent of other forces that act on the process. Such a conclusion is consistent with the fact that idea spread and global regime development influence many types of social change (Hughes 2001)
 miscellaneous other forces. Historic analysis would identify world war, economic depression, and other factors to explain the global pattern of democratization, especially the surge or retreat of waves.
A project document prepared for the CIA’s Strategic Assessment Group (SAG) analyzed historic data and, in cooperation with David Epstein and Larry Diamond, fit an approach to it that cut across these three levels (see Hughes 2002: 5974 for elaboration and documentation of the empirical work). The empirical work is not documented again here. The work did not find significant and consistent regional level effects, however, and the regional variables are therefore normally turned off.
The resulting formulation uses the domestic level as an initial base calculation because it is the empirically strongest piece, and later adds (optionally) the regional level effects and the systemic effects. The base calculation is further tied to the actual empirical levels in the initial year of the run, with the impact of the driving variables being felt only in change of those levels. An ‘expected" democracy level (DEMOCEXP) is computed using an analytic function that uses GDP per capita at purchasing power parity (GDPPCP) and the World Value Survey’s survival and selfexpression dimension (SURVSE). These were found quite powerful in their level of correlation with democracy and the WVS dimension, interestingly, carries a cultural component into the formulation. The user can further modify this basic formulation with an exogenous multiplier (democm).
 $ DEMOCPOLITYBase^t_{\gamma}=\mathbf{DEMOCPOLITY}^{t1}_{\gamma}*\frac{DEMOCEXP^t_{\gamma}}{DEMOCEXP^{t1}_{\gamma}}*\mathbf{democm}^t_{\gamma} $
where
 $ DEMOCEXP^t_{\gamma}=AnalFunc(GDPPCP^t_{\gamma},SURVSE^t_{\gamma}) $
It is also useful to have a separate calculation of the empirically strongest piece of the formulation, namely the domestic effects, but without any adjustment to the initial empirical values. The expected democracy variable (DEMOCEXP) carries that. It can be compared with the fully computed values to see the degree to which there may be tension in countries between democracy levels that GDP per capita and values would predict, on the one hand, and those that are in the initial data. The greatest tension levels tend to be in the Middle Eastern countries, where decmocracy is considerably below "expected" levels.
The initial conditions of democracy in countries carry a considerable amount of idiosyncratic, countryspecific influence, much of which can be expected to erode over time. Therefore a revised base level is computed that converges over time from the base component with the empirical initial condition built in to the value expected purely on the base of the analytic formulation. The user can control the rate of convergence with a parameter that specifies the years over which convergence occurs (polconv) and, in fact, basically shut off convergence by sitting the years very high.
 if $ \mathbf{sweffects}=1 $
 then $ SwingEffects^t_{\gamma}=timeadj*\mathbf{swingstsdem}_{\gammaSwinger,p1}*(WDemoc^{t1}DEMOCPOLITY^{t1}_{\gammaSwingee})+timeadj*\mathbf{swingstsdem}_{\gammaSwinger,p2}*(DEMOCPOLITY^{t1}_{\gammaSwinger}DEMOCPOLITY^{t1}_{\gammaSwingee})+timeadj*\mathbf{swingstsdem}_{\gammaSwinger,p3}*(RgDemocDEMOCPOLITY^{t1}_{\gammaSwingee}) $
where
 $ timeadj=.2 $
 $ WDemoc^{t1}=\frac{\sum^RDEMOCPOLITY^{t1}_{\gamma}}{R} $
else
 $ SwingEffects^t_{\gamma}=0 $
On top of the countryspecific calculation sits the (optional) regional or swing state effect calculation (SwingEffects), turned on by setting the swing states parameter (swseffects) to 1. The swing effects term has three components. The first is a world effect, whereby the democracy level in any given state (the "swingee") is affected by the world average level, with a parameter of impact (swingstdem) and a time adjustment (timeadj) . The second is a regionally powerful state factor, the regional "swinger" effect, with similar parameters. The third is a swing effect based on the average level of democracy in the region (RgDemoc).
David Epstein of Columbia University did extensive estimation of the parameters (the adjustment parameter on each term is 0.2). Unfortunately, the levels of significance were inconsistent across swing states and regions. Moreover, the term with the largest impact is the global term, already represented somewhat redundantly in the democracy wave effects. Hence, these swing effects are normally turned off and are available for optional use.
Also on top of the countrylevel effects sits the effect of global waves (DemGlobalEffects). Those depend on the amplitude of waves (DEMOCWAVE) relative to their initial condition and on a multiplier (EffectMul) that translates the amplitude into effects on states in the system. Because democracy and democratic wave literature often suggests that the countries in the middle of the democracy range are most susceptible to movements in the level of democracy, the analytic function enhances the affect in the middle range and dampens it at the high and low ends.
 $ DemGlobalEffect^t_{\gamma}=(DEMOCWAVE^t\mathbf{democwave^{t1}})*EffectMul_{\gamma} $
where
 $ MDemocPolity^{t1}_{\gamma}=MovingAverage(DEMOCPOLITY^{t1}_{\gamma}) $
 $ EffectMul_{\gamma}=AnalFunc(MDemocPolity^{t1}_{\gamma}) $
The democratic wave amplitude is a level that shifts over time (DemocWaveShift) with a normal maximum amplitude (democwvmax) and wave length (democwvlen), both specified exogenously, with the wave shift controlled by a endogenous parameter of wave direction that shifts with the wave length (DEMOCWVDIR). The normal wave amplitude can be affected also by impetus towards or away from democracy by a systemic leader (DemocImpLead), assumed to be the exogenously specified impetus from the United States (democimpus) compared to the normal impetus level from the U.S. (democimpusn) and the net impetus from other countries/forces (democimpoth).
 $ DEMOCWAVE^t=DEMOCWAVE^{t1}+DemocImpLead+\mathbf{democimpoth}+DemocWaveShift $
where
 $ DemocImpLead=\frac{(\mathbf{democimpusdemocimpusn)*eldemocimp}}{\mathbf{democwvlen}} $
 $ DemocWaveShift=\mathbf{\frac{democwvmax}{demowvlen}}*DEMOCWVDIR $
Given both the global and regional/swingstate effects, it is possible to add these to the basic country calculation for the final computation of the level of democracy using the Polity scale. The size of the swing effects is constrained by an external parameter (swseffmax).
 $ DEMOCPOLITY^t_{\gamma}=DEMOCPOLITYBaseRev^t_{\gamma}+SwingEffect^t_{\gamma}+DemGlobalEffects^t_{\gamma} $
Sociopolitical Equations: Stability/State Failure
The State Failure project has analyzed the propensity for different types of state failures within countries, including those associated with revolution, ethnic conflict, genocidepoliticide, and abrupt regime change (using categories and data pioneered by Ted Robert Gurr. Upon the advice of Gurr, IFs groups the first three as internal war and the last as political instability.
The extensive database of the project includes many measures of failure. IFs has variables representing three measures in each of the two categories, corresponding to the probability of the first year of a failure event (SFINSTABY1 and SFINTLWARY1), the probability of the first year or a continuing year (SFINSTABALL and SFINTLWARALL), and the magnitude of a first year or continuing event (SFINSTABMAG and SFINTLWARMAG).
Using data from the State Failure project, formulations were estimated for each variable using up to five independent variables that exist in the IFs model: democracy as measured on the Polity scale (DEMOCPOLITY), infant mortality (INFMOR) relative to the global average (WINFMOR), trade openness as indicated by exports (X) plus imports (M) as a percentage of GDP, GDP per capita at purchasing power parity (GDPPCP), and the average number of years of education of the population at least 25 years old (EDYRSAG25). The first three of these terms were used because of the state failure project findings of their importance and the last two were introduced because they were found to have very considerable predictive power with historic data.
The IFs project developed an analytic function capability for functions with multiple independent variables that allows the user to change the parameters of the function freely within the modeling system. The default values seldom draw upon more than 23 of the independent variables, because of the high correlation among many of them. Those interested in the empirical analysis should look to a project document (Hughes 2002) prepared for the CIA’s Strategic Assessment Group (SAG), or to the model for the default values.
One additional formulation issue grows out of the fact that the initial values predicted for countries or regions by the six estimated equations are almost invariably somewhat different, and sometimes quite different than the empirical rate of failure. There may well be additional variables, some perhaps countryspecific, that determine the empirical experience, and it is somewhat unfortunate to lose that information. Therefore the model computes three different forecasts of the six variables, depending on the user’s specification of a state failure history use parameter (sfusehist). If the value is 0, forecasts are based on predictive equations only. The equation below illustrates the formulation and that for the other five state failure variables varies with estimation. The analytic function obviously handles various formulations including linear and logarithmic.
 if $ \mathbf{sfusehist}=0 $ then (no history)
 $ SFINSTABALL^t_{\gamma}=PredictedTerm^t_{\gamma} $
where
 $ PredictedTerm^t_{\gamma}=ANALFUN(GDPPCP^t_{\gamma},DemocTerm^t,InfMorTerm^t,TradeTerm^t,Educ25Term^t) $
 $ DemocTerm=DemoPolity_{\gamma} $
 $ InfMorTerm=\frac{INFMOR_{\gamma}}{WINFMOR} $
 $ TradeTerm=\frac{X_{\gamma}+M_{\gamma}}{GDP}*100 $
 $ Educ25Term=EDYRSAG25_{\gamma} $
If the value of the sfusehist parameter is 1, the historical values determine the initial level for forecasting, and the predictive functions are used to change that level over time. Again the equation is illustrative.
 if $ \mathbf{sfusehist}=1 $ then (use history)
 $ SFINSTABALL^t_{\gamma}=\frac{PredictedTerm^t_f}{PredictedTerm^{t1}_f}*\mathbf{SFINSTABALL}^{t1}_{\gamma} $
where
 $ PredictedTerm=ANALFUN(GDPPCP^t_{\gamma},DemocTerm^t,InfMorTerm^t,TradeTerm^t,Educ25Term^t) $
 $ DemocTerm=DemoPolity_{\gamma} $
 $ InfMorTerm=\frac{INFMOR_{\gamma}}{WINFMOR} $
 $ TradeTerm=\frac{X_{\gamma}+M_{\gamma}}{GDP}*100 $
 $ Educ25Term=EDYRSAG25_{\gamma} $
If the value of the sfusehist parameter is 2, the historical values determine the initial level for forecasting, the predictive functions are used to change the level over time, and the forecast values converge over time to the predictive ones, gradually eliminating the influence of the countryspecific empirical base. That is, the second formulation above converges linearly towards the first over years specified by a parameter (polconv), using the CONVERGE function of IFs.
 if $ \mathbf{sfusehist}=3 $ then (converge)
 $ SFINSTABALLBase^t_{\gamma}=\frac{PredictedTerm^t_f}{PredictedTerm^{t1}_f}*\mathbf{SFINSTABALL}^{t1}_{\gamma} $
 $ SFINSTABALL^t_{\gamma}=ConvergeOverTime(SFINSTABALLBase^t_{\gamma},PredictedTerm^t_f,\mathbf{polconv}) $
where
 $ PredictedTerm=ANALFUN(GDPPCP^t_{\gamma},DemocTerm^t,InfMorTerm^t,TradeTerm^t,Educ25Term^t) $
 $ DemocTerm=DemoPolity_{\gamma} $
 $ InfMorTerm=\frac{INFMOR_{\gamma}}{WINFMOR} $
 $ TradeTerm=\frac{X_{\gamma}+M_{\gamma}}{GDP}*100 $
 $ Educ25Term=EDYRSAG25_{\gamma} $
Probability of state failure from different causes
The variables represent the probability of failure with respect to distinct conceptual groups of drivers.
 SFDEM (demography)
 SFECONDEV (economic/development)
 SFGOV (governance)
 SFIMBAL (structural imbalances)
Input variables needed to compute the probabilities
Drivers 
Coeff. 
Units 
Transformation 
Other specification 

Demography 
 
Infant mortality 
0.77919 
Deaths/1000 Births 
Ln 

population 
0.30204 
Millions 
Ln 

Population growth 
0.07767 
Percent 

Youth bulge (1529/15+) 
0.0077 
Percent 

Net migration 
0.29432 
Millions 

_cons 
8.23582 
 

Economic/Development 




GDP/cap 
0.30591 
Thousands (2011 PPP) 
Ln 

GDP/cap (log) growth 
0.06393 
Percent 

Life expectancy 
0.02537 
Years 

_cons 
2.06558 
 

Governance 




Polity 
0.03273 
10 to 10 

Polity^2 
0.02155 
Polity^{2} 

_cons 
2.89726 
 

Structural Imbalances 




polity v GDP/cap 
0.04735 
[Polity  Expected] 
Ln(GDP/cap) 
Pooled 
Life Exp. v GDP/cap 
0.0558 
[Life Exp.  Expected] 
Ln(GDP/cap) 
Partial Pool (re) 
Youth Bulge v Polity 
0.0131 
[Yth Blg %  Expected] 
Based on year 2013  
_cons 
4.23404 
 
Formulation for the probabilities is below, where β0 is the constant, β1…k are the parameters listed above, and X1…k are the driver values
Economic Inequality and Political Conflict
IFs does not yet include this important relationship. See Lichbach (1989) and Moore, Lindstrom, and O’Regan (1996) for analyses of how difficult this relationship is to specify. One critical problem is conceptualization of political conflict, political repression, political instability, political violence, political protest, etc. There are clearly many interacting, but separate dimensions for consideration. As Lichbach (1989: 448) says, "robust EIPC laws have not been discovered."
Drug Model Equations
We use linear regressions for each of the variables described above. We fit this linear equation to logistic curves to derive the final prevalence rate. The methodology used here is similar to what is used in the water and sanitation model in the International Futures tool to compute access to water and sanitation.^{[2]}
The values are computed using the equations given below,
 $ DRUGUSECOCAINE_{R}=(0.040239 * \frac{C_{R}}{POP_{R}}) + (1.966652 * GEM_{R}) + (0.476489* GINIDOM_{R})  8.7474 $
 $ DRUGUSEAMPHETAMINE_{R}=(3.522315 * YTHBULGE_{R}) + (2.495262* GEM_{R})7.801985 $
 $ DRUGUSEOPIATES_{R}=(.1.946209* LN(100 * \frac{INCOMELT310LN _{R}}{ POP_{R}}) + $
 $ (4.236404* YTHBULGE_{R}) + (.7277734 * LN(100 * \frac{POPURBAN _{R}}{ POP_{R}})  8.601204 $
 $ DRUGUSEPRESCRIPTOPIOID_{R}=(.2469778 * 100 * \frac{HLEXPEND_{R}} {GDP_{R}})7.063833 $
Where,
 C is the amount of household consumption in billion USD
 POP is the population
 YTHBULGE is the youth bulge (Population aged between 1529 years as a percent of the total population)
 INCOMELT310LN is the number of people living in poverty (earning less than USD 3.10 per day.
 POPURBAN is the number of people living in urban areas.
 HLEXPEND is the amount of health spending (private and public)
 GDP is the gross domestic product
PreProcessor and first year
The values for drug prevalence are initialized using illicit drug demand data from the UNODC. However, data availability from this source is low. Appendix II shows the data coverage across countries from the UNODC. Therefore, filling holes for the first year where no data is available is crucial. There are three options available to the user when filling holes. They are,
 Using IHME equations to fill holes The institute for health and metric evaluation also provides data on drug prevalence and this source has much higher coverage (184 countries from 1990 to 2016). However, this data pertains to treatment of drug prevalence. We developed regression equations to estimate levels of illicit drug use from the IHME drug prevalence data set. Appendix III describes these regression equations in detail.
 Using forecast year equations  This method uses the forecast year equations to derive the drug prevalence value for the first year of the model.
 Using regional averages from the UNODC Alternatively, we can also use regional averages for illicit drug prevalence to fill in holes for individual countries.
The user can choose the initialization method using the parameter druginitsw. By default the model will choose the first option i.e. using IHME equations to fill in holes for the first year of the model.
Forecast Years
Computing Drug Demand Using the Bottom Up Approach
In the forecast years, logistic regressions are used to first estimate the drug prevalence rates. The equations for amphetamines are shown below,
 $ z_{amphetamines}=(3.522315 * YTHBULGE_{R} )+ (2.495262* GEM_{R} )7.801985 $
 $ z_{cocaine}=(.040239 * \frac{C_{R}}{POP_{R}} + (1.96421* GEM_{R} )+(.0476489* GINIDOM_{(R)})8.7474 $
 $ z_{opiates}=(.7277734 * LN(POPURBAN_{R} ))+ (.42364* YTHBULGE_{R} )+(.1946*LN(INCOMELT190LN_{R} ))7.801985 $
 $ z_{presopioid}=(.2469778 * HLEXPEND_{R})7.06 $
This value is then used to compute the prevalence rate for each of the four drug types as follows,
 $ DRUGUSEAMPHETAMINE_{R}=100*\frac{e^{(z_{amphetamines} )}}{1+e^{(z_{amphetamines}) }} $
 $ DRUGUSECOCAINE_{R}=100*\frac{e^{(z_{cocaine} )}}{1+e^{(z_{cocaine}) }} $
 $ DRUGUSEOPIATES_{R}=100*\frac{e^{(z_{opiates} )}}{1+e^{(z_{opiates}) }} $
 $ DRUGUSEPRESCRIPTOPIOID_{R}=100*\frac{e^{(z_{presopioid} )}}{1+e^{(z_{presopioid}) }} $
The above values are then adjusted for the shift factor, multipliers and a cap on the maximum possible value
 $ DRUGUSEAMPHETAMINE_{R}=AMIN(DRUGUSEAMPHETAMINE_{R}+DrugShift_{R} ),2.3)*druguseamphetaminem_{R} $
 $ DRUGUSECOCAINE_{R}=AMIN(DRUGUSECOCAINE_{R}+DrugShift_{R} ),2.3)*drugusecocainem_{R} $
 $ DRUGUSEOPIATES_{R}=AMIN(DRUGUSEOPIATES_{R}+DrugShift_{R} ),2.3)*druguseopiatesm_{R} $
 $ DRUGUSEPRESCRIPTOPIOID_{R}=AMIN(DRUGUSEPRESCRIPTOPIOID_{R}+DrugShift_{R} ),2.3)*druguseprescriptopioidm_{R} $
Where,
DrugShift is the shift factor computed in the first year of the model which is used to chain the forecast values to the historical values from the data
2.3 is the cap on drug prevalence for amphetamines. ^{[3]}
AMIN is the function used to get the minimum value of drug prevalence and the cap (2.3). Since prevalence of drug usage tends to be slow moving over time, we have also capped the rate of growth of the prevalence rate for all four drug types. The growth rate in the drug prevalence rate is capped at 5 percent for every country for every year. However, this growth rate is not applicable when the parameters on drug prevalence rate are activated by a user.
Finally, total drug use is computed as the average of the four drug types divided by a drugusepolyindex parameter which is set to 1.2. This is done to account for users who use multiple drugs.
Adjusting Drug Use Using the Top Down Approach
The paragraph above described the computation of drug prevalence using the bottom up approach (i.e. drug prevalence is computed for each drug type individually and this is used to compute total drug demand). However, another approach to computing drug demand would be to compute total drug demand first and distribute that across drug types (i.e. a top down approach). The model computes total drug demand using this top down approach and then converges the drug demand computed through the bottom up approach to the same
The top down model uses youth bulge and household consumption as the two main drivers.
Total drug demand is calculated as,
 $ z_{druguseest}=(1.245 * YTHBULGE_{R} )+ (.508* \frac{C_{R}}{POP_{R}})3.498 $
 $ DruguseEst_{R}=100*\frac{(e^{(z_{druguseest} )}}{1+e^{(z_{druguseest} )}} $
The total drug use from the bottom up approach is converged to the above value over a period of 100 years. Note that there is a restriction on the year growth and decline rate of total drug use of 2%.
Violence Model Equations
Preprocessor and first year
In the preprocessor, each of the violence variables are initialized using death rate data from the Institute for Health and Metric Evaluation (IHME). Please note that we only forecast mortality and the model currently does not have a representation of the prevalence of violence.
For the conflict deaths, instead of using the latest data point for initialization, we use a weighted average of conflict deaths from the previous 10 years which is then divided by two to generate a more realistic number for the initialization.
Where no data is available for any particular type of violence, we use the forecast equations to fill in holes for the first year of the model.
In the first year of the model, we need to make sure that the total deaths from violence matches the total deaths from intentional injuries in the health model. Hence we normalize the total violence deaths to the total intentional injuries deaths. Please note that this normalization is optional (i.e. the user can activate a switch svvionormsw). The normalization will also be activated in the event the user turns on the forward linkage switch from the violence model to the health model svtohlsw.
For the normalization we first calculate the total deaths from intentional injuries in the health model. This term is called the AdjustedViolenceTerm. Now, we calculate the total deaths from the violence model and call this tem SVTerm. The deaths from the violence model are now normalized to the deaths from the health model using the equations below. (The below equation is used for normalizing conflict deaths. Similar equations are used for the other types of violence),
 $ SVDTHSCONFLICT_{R}=((AdjustedViolenceTerm_{R}*(SVDTHSCONFLICT_{R}*POP_{R}/SVTerm_{R})/POP_{R})*100000 $
Where,
POP is the total population
Shift factors are then calculated in the first year to chain the forecast values to the historical data.
Forecast Years
In the forecast years estimated values are calculated using forecast equations for each type of violence. The forecast equations have been explained in Table 1 below. Each of the types of violence are calculated using this estimated value and the respective shift factor calculated in the first year of the model and the multipliers on the death rates are applied.
The equations used are as follows,
 $ SVDTHSCONFLICT_{R}=((ConflictEst)_{R}+ConflictShift_{R})*svmulm_{R,2} $
 $ SVDTHSOTHERINTERPERSON_{R}=(HomicideEst_{R}+HomicideShift_{R})*svmulm_{R,5} $
 $ SVDTHSWOMENCHILDREN_{R}=(WomenandChilEst_{R}+WomenandChilShift_{R})*svmulm_{R,4} $
 $ SVDTHSPOLICS_{R}=(PoliceEst_{R}+ PoliceShift_{R})*svmulm_{R,3} $
 $ SVDTHSSELFHARM_{R}=(SelfHarmEst_{R}+ SelfHarmShift_{R})*svmulm_{R,1} $
Where,
ConflictEst, HomicideEst, WomenandChilEst, PoliceEst and SelfHarmEst are the estimated level of deaths calculated using the forecast equations.
ConflictShift, HomicideShift, WomenandChilShift, PoliceShift and SelfHarmShift are the shift factors calculated in the first year of the model.
No 
Function 
RSquared 
Independent variable 
Coefficient 
Constant 
1 
Conflict deaths computation 
0.5885 
Internal War magnitude 
.5501 
.0991 
2 
Police violence deaths computation 
0.1447 
Log of homicides 
.25879 
3.3145 
Police violence deaths computation 
0.1447 
Log of corruption 
0.28308 
3.3145  
3 
Interpersonal Violence Deaths computation 
0.21 
Youthbulge 
1.04344 
10.5462 
Interpersonal Violence Deaths computation 
0.21 
GINI 
2.4341 
10.5462 
After this, the total number of deaths are calculated for each category. For this purpose, we first calculate the total populations for adult males, women and children from the population model as AdultMaleTerm, WomenTerm and ChildrenTerm respectively. Next, we calculate the total number of deaths for each of the categories and apply the additive parameters on total deaths (svdthsadd) as follows,
 $ SVDTHSOTHERINTERTOT_{R}=(SVDTHSOTHERINTERPERSON_{R}/100000)*AdultMaleTerm)+svdthsadd_{R,5} $
 $ SVDTHSPOLICSTOT_{R}=(SVDTHSPOLICS_{R}/100000)*POP_{R})+svdthsadd_{R,3} $
 $ SVDTHSWOMENANDCHILTOT_{R}=(SVDTHSWOMENANDCHILDREN_{R}/100000)*(WomenTerm_{R}+ChildrenTerm_{R}))+svdthsadd_{R,3} $
 $ SVDTHSCONFLICTTOT_{R}=(SVDTHSCONFLICT_{R}/100000)*POP_{R})+svdthsadd_{R,2} $
 $ SVDTHSSELFHARMTOT_{R}=(SVDTHSSELFHARM_{R}/100000)*POP_{R})+svdthsadd_{R,1} $
After this stage, we calculate the total deaths from societal violence as a simple sum of each of the above categories,
 $ SVDTHSSOCIETALVIOLENCETOT_{R}=(SVDTHSCONFLICTTOT_{R}+SVDTHSOTHERINTERTOT_{R}+SVDTHSPOLICSTOT_{R} $
$ +SVDTHSWOMENANDCHILTOT_{R}+SVDTHSSELFHARMTOT_{R})+ svdthsadd_{R,6} $
Because we have applied additive parameters above, we perform a recalculation of the total death rates using the total number of deaths from each category of violence.
We now calculate the total death rate from societal violence,
 $ SOCIETALVIOLENCEDEATHS_{R}=(SVDTHSSOCIETALVIOLENCETOT_{R}/POP_{R} )*100000 $
Finally, the homicide index is calculated using each of the above except selfharm. The contribution of each term to the homicide index can be changed using the parameter svindexm. Each term is set to a value of 1 in the Base Case.
Policy Equations: Government Expenditures
The fiscal model of IFs is quite simple and builds on the computation of government consumption (GOVCON) in the economic model.
IFs expenditures fall into six categories: military, health, education, research and development, other, and foreign aid. IFs divides total government consumption (GOVCON) into these five destination sectors (GDS) with a vector of government spending coefficients (GK) based on initial conditions. The user can change that default pattern of government spending over time with a multiplier parameter (gdsm). The model normalizes the allocation to assure that the money spent is no more or less than total government consumption.
The last category of spending complicates the allocation of spending to destination categories. It is traditional not to think of foreign aid in terms of its percentage of the governmental budget (as we often think of defense or educational expenditures), but to think of it in terms of a percentage of the GDP. For instance, the United Nations has called for foreign aid spending equal to 0.7% (earlier 1.0%) of GDP of donor countries. Moreover, for some governments, foreign aid is not an expenditure, but a receipt and an addition to government revenues.
Therefore IFs actually calculates foreign aid expenditures and receipts first and fixes those amounts (see the foreign aid equations). It then allocates the amount of government spending that remains in the coffers of aid donors (or the augmented amount available to aid recipients) among the other categories, normalizing the allocation to the sum of the coefficients in those other categories.
 $ GDS^t_{\gamma,g}=GOVCON_{\gamma}*GK^{t1}_{\gamma,g}*\mathbf{gdsm}_{\gamma,g} $
where
 $ GK^{t1}_{\gamma,g}=\frac{\mathbf{GDS}^{t1}_{\gamma,g}}{\mathbf{GOVCON}^{t1}_{\gamma}} $
There are several forward linkages of government spending that are important. A mortality multiplier (MORTMG) is computed for the demographic model, using changes in health spending from the initial year and a parameter of the impact of that spending (elashc).
 $ MORTMG_{\gamma}=1(\frac{GDS_{\gamma,g=health}}{GDP_{\gamma}}\frac{\mathbf{GDS}^{t=1}_{\gamma,g=health}}{\mathbf{GDP}^{t=1}_{\gamma}})*\mathbf{elashc} $
Three of the forward linkages carry information on spending to the calculation of multifactor productivity in the economic production function, for additive rather than multiplicative use. One variable tracks change in education spending (CNGEDUC), modified by an elasticity of education on MFP (elmfped) and carries it forward. Another tracks changes in health spending (CNGHLTH) using a parameter (elmfphl). The third tracks changes in R&D spending with a parameter of impact (elmfprd). In each case there is a lag involved because of computational sequence.
 $ CNGEDUC^{t1}_{\gamma}=(\frac{GDS_{\gamma,g=educ}}{GDP_{\gamma}}\frac{\mathbf{GDS}^{t=1}_{\gamma,g=educ}}{\mathbf{GDP}^{t=1}_{\gamma}})*\mathbf{elmfped} $
 $ CNGHLTH^{t1}_{\gamma}=(\frac{GDS_{\gamma,g=health}}{GDP_{\gamma}}\frac{\mathbf{GDS}^{t=1}_{\gamma,g=health}}{\mathbf{GDP}^{t=1}_{\gamma}})*\mathbf{elmfphl} $
 $ CNGRAND^{t1}_{\gamma}=(\frac{GDS_{\gamma,g=R\&D}}{GDP_{\gamma}}\frac{\mathbf{GDS}^{t=1}_{\gamma,g=R\&D}}{\mathbf{GDP}^{t=1}_{\gamma}})*\mathbf{elmfprd} $
Because essentially of an older variable form for the education term that is still used in the agricultural model’s production function, the first of the three terms is transferred to that older variable (LEFMG).
 $ LEFMG^{t1}_{\gamma}=CNGEDUC^{t1}_{\gamma} $
Policy Equations: Foreign Aid
IFs uses a "pool" approach to aid (AID) rather than indicating bilateral flows from particular donors to particular recipients. That is, all aid from all donors flows into the pool and then all recipients draw proportions of the pool.
IFs uses the aid value parameter (AIDDON) to calculate the aid (AID) from donors and AIDREC to calculate the targeted aid to recipients. The pool of aid donations determines the actual total level of interstate aid flows, however, and is allocated among potential recipients according to the proportions targeted for each.
 $ AID_{\gamma}=\frac{GDP*(\mathbf{aidrec}_{\gamma}\mathbf{aiddon}_{\gamma})}{100} $
Aid outflows are negative and the total aid pool given (AIDP) is the sum of the negative flows, while the total desired aid of recipients (AIDR) is the sum of positive flows.
 $ AIDP=\sum^RAID_{\gamma} $ if $ AID_{\gamma}<1 $
 $ AIDR=\sum^RAID_{\gamma} $ if $ AID_{\gamma}>1 $
A recomputation of aid for recipients distributes the aid pool across their demands.
 $ AID=AIDP*\frac{AID_{\gamma}}{AIDR} $ if $ AID_{\gamma}>1 $
References
 ↑ here is the first reference
 ↑ Rothman, D.S. and Irfan M.T, IFs infrastructure model documentation, Working Paper 2013.07.22, Josef Korbel School of International Studies, University of Denver, Denver CO. https://pardee.du.edu/ifsinfrastructuremodeldocumentation
 ↑ These caps have been chosen on the basis of the highest historical global prevalence rates