likelihood inference of nonlinear models based on a class of

9
Research Article Likelihood Inference of Nonlinear Models Based on a Class of Flexible Skewed Distributions Xuedong Chen, 1 Qianying Zeng, 2 and Qiankun Song 2 1 School of Science, Huzhou Teachers’ College, Huzhou 313000, China 2 School of Management, Chongqing Jiaotong University, Chongqing 400074, China Correspondence should be addressed to Qiankun Song; [email protected] Received 28 August 2014; Accepted 28 September 2014; Published 3 December 2014 Academic Editor: Jinde Cao Copyright © 2014 Xuedong Chen et al. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. is paper deals with the issue of the likelihood inference for nonlinear models with a flexible skew-t-normal (FSTN) distribution, which is proposed within a general framework of flexible skew-symmetric (FSS) distributions by combining with skew-t-normal (STN) distribution. In comparison with the common skewed distributions such as skew normal (SN), and skew-t (ST) as well as scale mixtures of skew normal (SMSN), the FSTN distribution can accommodate more flexibility and robustness in the presence of skewed, heavy-tailed, especially multimodal outcomes. However, for this distribution, a usual approach of maximum likelihood estimates based on EM algorithm becomes unavailable and an alternative way is to return to the original Newton-Raphson type method. In order to improve the estimation as well as the way for confidence estimation and hypothesis test for the parameters of interest, a modified Newton-Raphson iterative algorithm is presented in this paper, based on profile likelihood for nonlinear regression models with FSTN distribution, and, then, the confidence interval and hypothesis test are also developed. Furthermore, a real example and simulation are conducted to demonstrate the usefulness and the superiority of our approach. 1. Introduction e common assumption of distribution for random error is normal in statistical modeling. is assumption may lack the robustness against departures from normality and/or outliers and may result in misleading inferential results [1, 2]. For the past few years, there is an increasing interest in developing more flexible parametric families capable of adopting as closely as possible real data, which exhibit quite substantial nonnormal characteristics such as skewness and heavy tails. In a variety of applications, one popular option is to modify a symmetric probability density function of a variable, thereby introducing skewness. An important advantage of this sort of approach compared with other approaches to robustness is an explicit statement of the probabilistic setting, leading to a clear interpretation of the results [3]. Following this idea, the skw-normal (SN) distribution was firstly introduced by [4], and, then, the skew-t (ST) distribution was introduced by [5]; the skew-t-normal (STN) was introduced by [6]; moreover, some extensions to these multivariate cases were studied by [7, 8] and so on. Since then, several authors have tried to extend these results to more general forms of skew-symmetric distributions, of which here we would like to mention [9], in this paper; they proposed a general framework of distributions which is called flexible skew- symmetric (FSS) distribution. As pointed out by [10] that this distribution family enjoys a sufficient flexibility in that with different choice of submodel settings, the FSS distribution includes several known distributions such as the SN and ST as its special cases. However, in many practical applications, it is not rare at all to encounter a multimodality, sometimes with an even irregular shape, and, for this case, all the distributions mentioned above appear to be unsufficient to describe the multimodal feature of the data. A solution to this problem is to use finite mixture models. In [11], the authors worked with a mixture model with component densities belonging to the STN distribution and a computationally feasible EM- type algorithm was developed for calculating the maximum likelihood (ML) estimates of parameters. Unfortunately, although the proposed methodology is useful for analyzing multimodal asymmetric data, it suffers from the problem of Hindawi Publishing Corporation Abstract and Applied Analysis Volume 2014, Article ID 542985, 8 pages http://dx.doi.org/10.1155/2014/542985

Upload: ngodung

Post on 03-Feb-2017

226 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Likelihood Inference of Nonlinear Models Based on a Class of

Research ArticleLikelihood Inference of Nonlinear Models Based on a Class ofFlexible Skewed Distributions

Xuedong Chen1 Qianying Zeng2 and Qiankun Song2

1 School of Science Huzhou Teachersrsquo College Huzhou 313000 China2 School of Management Chongqing Jiaotong University Chongqing 400074 China

Correspondence should be addressed to Qiankun Song qiankunsong163com

Received 28 August 2014 Accepted 28 September 2014 Published 3 December 2014

Academic Editor Jinde Cao

Copyright copy 2014 Xuedong Chen et alThis is an open access article distributed under the Creative Commons Attribution Licensewhich permits unrestricted use distribution and reproduction in any medium provided the original work is properly cited

This paper deals with the issue of the likelihood inference for nonlinear models with a flexible skew-t-normal (FSTN) distributionwhich is proposed within a general framework of flexible skew-symmetric (FSS) distributions by combining with skew-t-normal(STN) distribution In comparison with the common skewed distributions such as skew normal (SN) and skew-t (ST) as well asscale mixtures of skew normal (SMSN) the FSTN distribution can accommodate more flexibility and robustness in the presenceof skewed heavy-tailed especially multimodal outcomes However for this distribution a usual approach of maximum likelihoodestimates based on EM algorithm becomes unavailable and an alternative way is to return to the original Newton-Raphson typemethod In order to improve the estimation as well as the way for confidence estimation and hypothesis test for the parametersof interest a modified Newton-Raphson iterative algorithm is presented in this paper based on profile likelihood for nonlinearregression models with FSTN distribution and then the confidence interval and hypothesis test are also developed Furthermorea real example and simulation are conducted to demonstrate the usefulness and the superiority of our approach

1 Introduction

The common assumption of distribution for random error isnormal in statistical modelingThis assumption may lack therobustness against departures from normality andor outliersand may result in misleading inferential results [1 2] For thepast few years there is an increasing interest in developingmore flexible parametric families capable of adopting asclosely as possible real data which exhibit quite substantialnonnormal characteristics such as skewness and heavy tailsIn a variety of applications one popular option is to modify asymmetric probability density function of a variable therebyintroducing skewness An important advantage of this sortof approach compared with other approaches to robustnessis an explicit statement of the probabilistic setting leading toa clear interpretation of the results [3] Following this ideathe skw-normal (SN) distribution was firstly introduced by[4] and then the skew-t (ST) distribution was introducedby [5] the skew-t-normal (STN) was introduced by [6]moreover some extensions to these multivariate cases werestudied by [7 8] and so on Since then several authors

have tried to extend these results to more general formsof skew-symmetric distributions of which here we wouldlike to mention [9] in this paper they proposed a generalframework of distributions which is called flexible skew-symmetric (FSS) distribution As pointed out by [10] that thisdistribution family enjoys a sufficient flexibility in that withdifferent choice of submodel settings the FSS distributionincludes several known distributions such as the SN and STas its special cases

However in many practical applications it is not rareat all to encounter a multimodality sometimes with aneven irregular shape and for this case all the distributionsmentioned above appear to be unsufficient to describe themultimodal feature of the data A solution to this problemis to use finite mixture models In [11] the authors workedwith a mixture model with component densities belongingto the STN distribution and a computationally feasible EM-type algorithm was developed for calculating the maximumlikelihood (ML) estimates of parameters Unfortunatelyalthough the proposed methodology is useful for analyzingmultimodal asymmetric data it suffers from the problem of

Hindawi Publishing CorporationAbstract and Applied AnalysisVolume 2014 Article ID 542985 8 pageshttpdxdoiorg1011552014542985

2 Abstract and Applied Analysis

ldquomodel identificationrdquo as the number of the parameters tobe estimated is usually large As a result in this paper wedeal with a new extension of the class of FSS distributionswhich is referred to as flexible skew-t-normal (FSTN) distri-bution This new distribution is proposed within the generalframework of the FSS distributions in combination with thedefinition of STN distribution In practical applications it isable to regulate the density in a more flexible way to offerrobustness and it can be treated as an appealing option foraccommodating data with skewness and heavy tails as well asmultimodality jointly

On the other hand nonlinear regression models arewidely applied in the fields of economics engineeringbiomedical research and so forth where the nonlinearfunction of unknown parameters is used to explain orinvestigate the nonlinear relationship of random phenomenaunder study More recently several authors have used aclass of skewed distributions in the context of nonlinearregression models and some valuable results were obtainedFor example [12] developed the robust estimation and thelocal influence analysis for regression model with SMSNdistribution From Bayesian point of view [13] consideredthe Bayesian estimation and the case influence diagnosticsfor nonlinear regression models with SMSN distributionsMore related literature could be found in [14ndash17] Generallyspeaking for model fitting of the nonlinear regression withskewed distributions a popular approach is to considerthe hierarchical representation of variables with a specificdistribution inwhich the postulated distribution is expressedas several conditional distributions of simpler forms suchas normal and Studentrsquos 119905 and Gamma Based on that EMalgorithm or Bayesian hierarchical approach then can beimplemented effectively for conducting model estimationand statistical inference

In this paper our aim is to develop an approach to like-lihood inference of nonlinear regression models with FSTNassumption As there is no stochastic representation for FSTNdistribution all the methods cited above become unavailablefor our considered problem and an alternativeway is to returnto the original Newton-Raphson iterative procedure formodel estimation Under the nonlinear regression paradigmthe accuracy of estimates is affected by the strength ofnonlinearity and the corresponding confidence interval andhypothesis test require the assumption of normality of theestimators or distribution which is too restrictive Besidesconsidering that in many practical applications rather thanthe total parameters we are usually interested in a propersubset of them By taking all these factors into accountin this paper we focus on the parameters of interest andpropose a modified Newton-Raphson iterative algorithm forcalculating the ML estimates based on profile likelihoodFurthermore the confidence interval and hypothesis test forthe parameters of interest are also considered We conductan application and a simulation study to compare the algo-rithm effectiveness and distribution robustness for nonlinearregression model in terms of fitting performance and modelselection The results from the numerical examples illustratethe usefulness and the superiority of our methodology

The remainder of this paper is organized as followsIn Section 2 we briefly discuss the FSS distribution andFSTN distributions In Section 3 we present the likelihoodinference including the quantities of the first- and the second-order derivatives as well as the standard Newton-Raphsoniterative formula In Section 4 we give an introductionof profile inference for our proposed model where theconfidence estimation and hypothesis test are presented tooSection 5 gives numerical examples using both simulatedand real data to illustrate the performance of the proposedmethodology Finally some concluding remarks are given inSection 6

2 Models and Notation

The class of skewed distributions such as SN ST and STNperform to be plausible for modeling skewness or (and)heavy tails underlying the observations The actual situationis that it is not rare at all to encounter multimodalitysometimes with an even more irregular shape and for thiscase the aforementioned distributions become unsufficientIn this paper with the adoption of a sufficiently flexibleclass of distributions we consider one of these extensionsreferred to as the family of flexible skew-symmetric (FSS)distributions which is introduced by [9] with the followingdensity function of type

119891 (119910 120583 120596120572) = 2120596minus1

1198910

(119911) 119866 119875119870

(119911) (1)

where 1198910

and 119866 are symmetric univariate density and dis-tribution function respectively that is 119891

0

(119909) = 1198910

(minus119909)119866(minus119909) = 1minus119866(119909) and 119875

119870

(119909) = 1205721

119909+1205723

1199093

+ sdot sdot sdot 1205722119870minus1

1199092119870minus1 is

an odd polynomial of degree 119870 (ie a polynomial includingonly terms of odd degree) 119911 = 120596

minus1

(119909 minus 120583) 120583 isin 119877 is thelocation parameter 120596 gt 0 is the scale parameter and 120572 =

(1205721

1205723

1205722119870minus1

) and1205721

1205723

1205722119870minus1

isin 119877 are shape param-eters

In general the density function of STN distribution canbe represented as 2120596

minus1

119905(119911 V)Φ(120572119911) where 119905 and Φ respec-tively denote the univariate standard Studentrsquos 119905 densityfunction and the univariate standard normal distributionfunction and V is the degrees of freedom The skewness isregulated by the shape parameter 120572 and the tail thickness ofthe distribution is controlled by V Commonly in comparisonwith the SN distribution the STN distribution exhibitsobvious feature of heavy tails when V le 10

In this paper we work with one version of (1) and thespecific definition can be presented as follows Let 119891

0

(119911) =

119905(119911 V) and 119866(119909) = Φ(119909) where Φ(119909) is defined as before and

119905 (119911 V) =Γ (V + 1) 2

120596 (V120587)12

Γ (V2)

(1 +1199112

V)

minus((V+1)119889)

(2)

that is the density function of univariate Studentrsquos 119905 distribu-tion with 0 location 120596 scale and V degrees of freedom Theabove extension of (1) is referred to as flexible skew-t-normal(FSTN) distribution denoted by FSTN(120583 120596120572 V)

It is noted that the FSTN distribution is proposed withinthe general framework of FSS distribution by combining with

Abstract and Applied Analysis 3

the definition of STN distribution and as a consequence itshares analogous feature with these two distributions Forall of that the FSTN distribution presents some interestingand peculiar features and is able to regulate the density ina more flexible way To be particular except for modelingskewness and tail thickness the FSTN distribution allows formultimodality depending on the specific setting of 120572 For thepurpose of comparison and illustration we assume 119891

0

(119911) =

120601(119911) and 119866(119911) = Φ(119911) in FSS distribution which is denotedby FSN and then we set 119870 = 2 in 119875

119870

(119911) that is 119875119870

(119909) =

1205721

119909 + 1205723

1199093 in STN distribution and for this case different

selections of 1205721

and 1205723

determine whether the density isunimodal or bimodal Moreover the same assumption for 119866

and 119875 in FSTN distribution is madeFigure 1 displays the density functions of FSN and STN

as well as FSTN distributions with four different situationsconsidered namely 120572

1

= 1 1205723

= 0 and V = 10 1205721

= 1205723

= 1

and V = 6 1205721

= 1 1205723

= minus1 and V = 4 1205721

= minus11205723

= 1 and V = 4 respectively with 120583 = 1 120596 = 15 for allcases By examination of Figure 1 we can detect how thesethree densities change with different combinations of 120572 andV For instance in Figure 1(a) FSN STN and FSTN appearto be very close while STN and FSTN are heavy tailed to alittle extent In Figure 1(b) both FSN and FSTN are unimodalwhen 120572

1

and 1205723

keep the same sign and the ranking for thedegree of skewness is STN FSTN and FSN in turn that isthe STN and FSTN distributions have thicker tails comparedto FSN distribution With opposite sign of 120572

1

and 1205723

bothFSN and FSTN distributions are bimodal and highly skewedin Figures 1(c) and 1(d) moreover in the same directionFSTN has thicker tails than FSN distribution Our proposedFSTN distribution can be treated as a proper compromisebetween the FSS distribution and the STN distribution Itallows for a wider range of tail behavior compared to FSSdistributionwhereas it is able to accommodatemultimodalitywhich cannot be described by STN distribution From theapplied viewpoint the FSTN distribution is an appealingoption which can be expected to yield robust inferentialresults in the presence of outlying observations

3 Likelihood Inference

Consider 119899 independent observations satisfying a nonlinearregression model as [119910

119894

] sim FSTN(120583119894

120596120572 V) with 120583119894

= 120578(x119894

120573) for 119894 = 1 2 119899 Here x

119894

is a 119901-dimensional vector and 120573is a 119901 times 1 vector of parameters Also let X = (x

1

x119899

)119879 be

the 119899 times 119901 design matrix 120578(sdot sdot) is a known twice differentiablefunction Then the corresponding log-likelihood for param-eter 120579 = (120573120572 120596 V) is given by

119897 (120579) = constant minus 119899 log120596 +

119899

sum

119894=1

log 119905 (119911119894

) +

119899

sum

119894=1

logΦ 119875119870

(119911119894

)

(3)And we have

120597119897 (120579)

120597120573= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

))

120597120578119894

120597120573

120597119897 (120579)

120597120596= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)119911119894

+ 119911119894

1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)+ 119899)

120597119897 (120579)

120597V=

119899

sum

119894=1

1199112

119894

119905 (119911119894

)+

119899

2(119863119866 (

V + 1

2) minus 119863119866 (

V2

) minus1

2)

120597119897 (120579)

1205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)1198751015840

119870

(119911119894

) 119895 = 1 2 119870

(4)

where 120578119894

= 120578(x119894

120573) and 119863119866(119909) = (119889[log Γ(119909)])119889119909 The cor-responding second-order derivatives of (3) can be shown as

1205972

119897 (120579)

120597120573120597120573119879

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

minus1

120596[

1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

1205972

119897 (120579)

120597120573120597120596=

119899

sum

119894=1

1

1205962

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

+119911119894

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

120597120573120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

1205971205731205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

[1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573 119895 = 1 2 119870

1205972

119897 (120579)

1205971205962

=1

1205962

119899

sum

119894=1

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)] 3119911119894

+ 1199112

119894

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

119897 (120579)

120597120596120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

1205972

119897 (120579)

1205971205961205971205722119895minus1

= minus

119899

sum

119894=1

[1

120596(1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)) 119911119894

]

times 1199112119895minus1

119894

119895 = 1 2 119870

4 Abstract and Applied Analysis

000

005

010

015

020

025

030

x

Density

FSNSTNFSTN

minus2 0 2 4 6

(a)

x

FSNSTNFSTN

000

005

010

015

020

025

030

035

Density

minus2 0 2 4 6

(b)

x

FSNSTNFSTN

minus2 0

000

005

010

015

020

025

030

Density

2 4 6

(c)

x

FSNSTNFSTN

000

005

010

015

020

025

030

Density

minus2 0 2 4 6

(d)

Figure 1 ((a)ndash(d)) Density functions of FSN STN and FSTN distributions with four different situations

1205972

119897 (120579)

120597V2=

119899

2[119879119866 (

V + 1

2) minus 119879119866 (

V2

)] +

119899

sum

119894=1

log (119905 (119911119894

))

1205972

119897 (120579)

1205971205722119895minus1

1205971205722119896minus1

=

119899

sum

119894=1

119911119895+119896

119894

119876 119875119870

(119911119894

) 119895 119896 = 1 2 119870

1205972

119897 (120579)

120597V1205971205722119895minus1

= 0 119895 = 1 2 119870

(5)

where 119879119866(119909) = (1198892

[log Γ(119909)])1198891199092 ℎ(119909) = (119905

10158401015840

(119909)119905(119909) minus

[1199051015840

(119909)]2

)1199052

(119909) and 119876(119909) = (120601(119909)Φ(119909))[119909 + (120601(119909)Φ(119909))]Assume119880(120579) = (119880

119879

(120573) 119880(120596) 119880(V) 119880119879(120572))119879 is the gradi-ent or score vector and 119867(120579) is the Hessian matrix composedof the above second-order derivatives To obtain the MLestimate of 120579 the Newton-Raphson iteration algorithm isdefined by

(119896+1)

= (119896)

minus [119867(120579)]

minus1

120579=

120579

(119896) [119880(120579)]120579=

120579

(119896) (6)

Abstract and Applied Analysis 5

It is noted that the above iterative procedure is an unpar-titioned algorithm that is all the parameters includingnonlinear regression coefficients 120573 scale parameter 120596 andshape parameter 120572 as well as tail thickness parameter V areestimated simultaneously For our considered problem thereare at least two difficulties that may be encountered for (6)the first one is that once the number of the parameters to beestimated becomes large the corresponding computationalburden turns to be heavy with an unacceptable estimationerror and the second one is as follows when the strength ofnonlinearity of the link function 120578(sdot sdot) changes the iterativeprocessmay becomeunstable or even nonconvergent leadingto the poor estimation results Considering the needs ofpractical problems rather than the total parameter set weare usually interested in a proper subset of it To improvethe efficiency of the algorithm and to facilitate statisticalinference of the nonlinearmodels with FSTNdistribution weput forward the following profile likelihoodmethod based on(3) and (6)

4 Profile Likelihood Inference

41 Profile Estimation Algorithm Let 120579 = (120573119879

120574119879

) be a parti-tion of 120579 where 120573 is a parameter vector of nonlinear regres-sion an interest parameter and 120574 is a 119902-dimension (119902 = 119870+

2) nuisance parameter with 120574 = (120572 120596 V)119879 Similarly thepartition of 119867 and 119880 is given as 119867(120579) = (

11986711 11986712

11986721 11986722) where

11986711

= (1205972

119897(120579)120597120573120597120573119879

) 11986712

= (1205972

119897(120579)120597120573120597120574119879

) = (1205972

119897(120579)

120597120573120597120596 1205972

119897(120579)120597120573120597V 1205972119897(120579)1205971205731205971205721

1205972

119897(120579)1205971205731205971205722119896minus1

) 11986712

= [11986721

]119879 and 119867

22

= (1205972

119897(120579) 120597120574120597120574119879

) and the diagonalelements of 119867

22

are given by 1205972

119897(120579) 1205971205962

1205972

119897(120579)120597V21205972

119897(120579)1205971205722

1

1205972

119897(120579)1205971205722

2119896minus1

respectively and the off-diagonal elements in 119867

22

can be obtained similarly Let119880(120579) = (119880

1

1198802

) corresponding to the partition of 120579In the subsequent context we focus on the estimation of120579 based on profile likelihood method [18] Firstly suppose 120573is known and we rewrite the original likelihood function (3)as

119897 (120579) = 119897 (120573 120574) = 119897120573

(120574) (7)

where notation 119897120573

(120574) denotes that 120573 is fixed but 120574 varies Foreach 120573 to estimate 120574 we can obtain

120573

= argmax120574

119897120573

(120574) (8)

Alternatively to estimate 120573 we evaluate the maximum valueof 119897120573

(120574) over 120573

and have

119901

= argmax120573

119897120573

(120573

) = argmax120573

119897 (120573 120573

) (9)

where 119897(120573 120573

) and 119901

are referred to as the profile likelihoodfunction and the profile ML estimation respectively

Following [19] we define the profile Newton-Raphsoniteration formula as follows

(119896+1)

119901

= (119896)

119901

minus [119867120573|120574

]minus1

(119880120573|120574

)

(119896+1)

119901

= (119896)

119901

minus ((11986722

)minus1

1198801

+ (11986722

)minus1

11986721

[119867120573|120574

]minus1

(119880120573|120574

))

(10)

where 119880120573|120574

= 1198801

minus 11986712

(11986722

)minus1

1198802

119867120573|120574

= 11986711

minus

11986712

(11986722

)minus1

11986721

and all the matrices and the vectors on theright hand side of (10) are evaluated at

(119896)

119901

and (119896)119901

Note that both in (6) and in (10) the strength of

nonlinearity of link function 120578(sdot sdot) is reflected by 1205972

120578119894

120597120573120597120573119879

to large extents Therefore by examination of the expressionof 1205972

119897(120579)120597120573120597120573119879 we find that when the element of

100381610038161003816100381610038161003816100381610038161003816

120596 [1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

100381610038161003816100381610038161003816100381610038161003816

(11)

is much less than the corresponding element of1003816100381610038161003816100381610038161003816100381610038161003816

120597120578119879

119894

120597120573[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573

100381610038161003816100381610038161003816100381610038161003816

(12)

the following approximating result can be obtained as

119867lowast

11

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

(13)

And then the iteration formulas of 119880lowast

120573|120574

[119867lowast

120573|120574

]minus1 119901

and

lowast

119901

can also be obtained just as before The above estimationprocedure is referred to as profile modified Newton-Raphsoniteration algorithm

The stopping rule for the above algorithm can bepresented as that iteration proceeds until some distanceinvolving two successive evaluations of the profile likeli-hood 119897(120573

120573

) such as |119897((119896+1)

120573

(119896+1)) minus 119897((119896)

120573

(119896))| or

|119897((119896+1)

120573

(119896+1))119897((119896)

120573

(119896))minus1| is small enough for example120576 lt 10

minus4 is adopted in this paperThe asymptotic covariancematrix of theML estimates for

profile likelihood can be evaluated by inverting the expectedinformation matrix however it does not have a closedform expression the observed information matrix 119869(120573) =

minus(1205972

119897(120573 120573

)120597120573120597120573119879

) can be used as a replacement which isestimated by 119867

11

minus 11986712

(11986722

)minus1

11986721

where 11986711

11986712

and 11986722

can be obtained as similar as aboveThe choice of the initial values plays an important role in

nonlinear regression fitting in this paper the specific steps

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 2: Likelihood Inference of Nonlinear Models Based on a Class of

2 Abstract and Applied Analysis

ldquomodel identificationrdquo as the number of the parameters tobe estimated is usually large As a result in this paper wedeal with a new extension of the class of FSS distributionswhich is referred to as flexible skew-t-normal (FSTN) distri-bution This new distribution is proposed within the generalframework of the FSS distributions in combination with thedefinition of STN distribution In practical applications it isable to regulate the density in a more flexible way to offerrobustness and it can be treated as an appealing option foraccommodating data with skewness and heavy tails as well asmultimodality jointly

On the other hand nonlinear regression models arewidely applied in the fields of economics engineeringbiomedical research and so forth where the nonlinearfunction of unknown parameters is used to explain orinvestigate the nonlinear relationship of random phenomenaunder study More recently several authors have used aclass of skewed distributions in the context of nonlinearregression models and some valuable results were obtainedFor example [12] developed the robust estimation and thelocal influence analysis for regression model with SMSNdistribution From Bayesian point of view [13] consideredthe Bayesian estimation and the case influence diagnosticsfor nonlinear regression models with SMSN distributionsMore related literature could be found in [14ndash17] Generallyspeaking for model fitting of the nonlinear regression withskewed distributions a popular approach is to considerthe hierarchical representation of variables with a specificdistribution inwhich the postulated distribution is expressedas several conditional distributions of simpler forms suchas normal and Studentrsquos 119905 and Gamma Based on that EMalgorithm or Bayesian hierarchical approach then can beimplemented effectively for conducting model estimationand statistical inference

In this paper our aim is to develop an approach to like-lihood inference of nonlinear regression models with FSTNassumption As there is no stochastic representation for FSTNdistribution all the methods cited above become unavailablefor our considered problem and an alternativeway is to returnto the original Newton-Raphson iterative procedure formodel estimation Under the nonlinear regression paradigmthe accuracy of estimates is affected by the strength ofnonlinearity and the corresponding confidence interval andhypothesis test require the assumption of normality of theestimators or distribution which is too restrictive Besidesconsidering that in many practical applications rather thanthe total parameters we are usually interested in a propersubset of them By taking all these factors into accountin this paper we focus on the parameters of interest andpropose a modified Newton-Raphson iterative algorithm forcalculating the ML estimates based on profile likelihoodFurthermore the confidence interval and hypothesis test forthe parameters of interest are also considered We conductan application and a simulation study to compare the algo-rithm effectiveness and distribution robustness for nonlinearregression model in terms of fitting performance and modelselection The results from the numerical examples illustratethe usefulness and the superiority of our methodology

The remainder of this paper is organized as followsIn Section 2 we briefly discuss the FSS distribution andFSTN distributions In Section 3 we present the likelihoodinference including the quantities of the first- and the second-order derivatives as well as the standard Newton-Raphsoniterative formula In Section 4 we give an introductionof profile inference for our proposed model where theconfidence estimation and hypothesis test are presented tooSection 5 gives numerical examples using both simulatedand real data to illustrate the performance of the proposedmethodology Finally some concluding remarks are given inSection 6

2 Models and Notation

The class of skewed distributions such as SN ST and STNperform to be plausible for modeling skewness or (and)heavy tails underlying the observations The actual situationis that it is not rare at all to encounter multimodalitysometimes with an even more irregular shape and for thiscase the aforementioned distributions become unsufficientIn this paper with the adoption of a sufficiently flexibleclass of distributions we consider one of these extensionsreferred to as the family of flexible skew-symmetric (FSS)distributions which is introduced by [9] with the followingdensity function of type

119891 (119910 120583 120596120572) = 2120596minus1

1198910

(119911) 119866 119875119870

(119911) (1)

where 1198910

and 119866 are symmetric univariate density and dis-tribution function respectively that is 119891

0

(119909) = 1198910

(minus119909)119866(minus119909) = 1minus119866(119909) and 119875

119870

(119909) = 1205721

119909+1205723

1199093

+ sdot sdot sdot 1205722119870minus1

1199092119870minus1 is

an odd polynomial of degree 119870 (ie a polynomial includingonly terms of odd degree) 119911 = 120596

minus1

(119909 minus 120583) 120583 isin 119877 is thelocation parameter 120596 gt 0 is the scale parameter and 120572 =

(1205721

1205723

1205722119870minus1

) and1205721

1205723

1205722119870minus1

isin 119877 are shape param-eters

In general the density function of STN distribution canbe represented as 2120596

minus1

119905(119911 V)Φ(120572119911) where 119905 and Φ respec-tively denote the univariate standard Studentrsquos 119905 densityfunction and the univariate standard normal distributionfunction and V is the degrees of freedom The skewness isregulated by the shape parameter 120572 and the tail thickness ofthe distribution is controlled by V Commonly in comparisonwith the SN distribution the STN distribution exhibitsobvious feature of heavy tails when V le 10

In this paper we work with one version of (1) and thespecific definition can be presented as follows Let 119891

0

(119911) =

119905(119911 V) and 119866(119909) = Φ(119909) where Φ(119909) is defined as before and

119905 (119911 V) =Γ (V + 1) 2

120596 (V120587)12

Γ (V2)

(1 +1199112

V)

minus((V+1)119889)

(2)

that is the density function of univariate Studentrsquos 119905 distribu-tion with 0 location 120596 scale and V degrees of freedom Theabove extension of (1) is referred to as flexible skew-t-normal(FSTN) distribution denoted by FSTN(120583 120596120572 V)

It is noted that the FSTN distribution is proposed withinthe general framework of FSS distribution by combining with

Abstract and Applied Analysis 3

the definition of STN distribution and as a consequence itshares analogous feature with these two distributions Forall of that the FSTN distribution presents some interestingand peculiar features and is able to regulate the density ina more flexible way To be particular except for modelingskewness and tail thickness the FSTN distribution allows formultimodality depending on the specific setting of 120572 For thepurpose of comparison and illustration we assume 119891

0

(119911) =

120601(119911) and 119866(119911) = Φ(119911) in FSS distribution which is denotedby FSN and then we set 119870 = 2 in 119875

119870

(119911) that is 119875119870

(119909) =

1205721

119909 + 1205723

1199093 in STN distribution and for this case different

selections of 1205721

and 1205723

determine whether the density isunimodal or bimodal Moreover the same assumption for 119866

and 119875 in FSTN distribution is madeFigure 1 displays the density functions of FSN and STN

as well as FSTN distributions with four different situationsconsidered namely 120572

1

= 1 1205723

= 0 and V = 10 1205721

= 1205723

= 1

and V = 6 1205721

= 1 1205723

= minus1 and V = 4 1205721

= minus11205723

= 1 and V = 4 respectively with 120583 = 1 120596 = 15 for allcases By examination of Figure 1 we can detect how thesethree densities change with different combinations of 120572 andV For instance in Figure 1(a) FSN STN and FSTN appearto be very close while STN and FSTN are heavy tailed to alittle extent In Figure 1(b) both FSN and FSTN are unimodalwhen 120572

1

and 1205723

keep the same sign and the ranking for thedegree of skewness is STN FSTN and FSN in turn that isthe STN and FSTN distributions have thicker tails comparedto FSN distribution With opposite sign of 120572

1

and 1205723

bothFSN and FSTN distributions are bimodal and highly skewedin Figures 1(c) and 1(d) moreover in the same directionFSTN has thicker tails than FSN distribution Our proposedFSTN distribution can be treated as a proper compromisebetween the FSS distribution and the STN distribution Itallows for a wider range of tail behavior compared to FSSdistributionwhereas it is able to accommodatemultimodalitywhich cannot be described by STN distribution From theapplied viewpoint the FSTN distribution is an appealingoption which can be expected to yield robust inferentialresults in the presence of outlying observations

3 Likelihood Inference

Consider 119899 independent observations satisfying a nonlinearregression model as [119910

119894

] sim FSTN(120583119894

120596120572 V) with 120583119894

= 120578(x119894

120573) for 119894 = 1 2 119899 Here x

119894

is a 119901-dimensional vector and 120573is a 119901 times 1 vector of parameters Also let X = (x

1

x119899

)119879 be

the 119899 times 119901 design matrix 120578(sdot sdot) is a known twice differentiablefunction Then the corresponding log-likelihood for param-eter 120579 = (120573120572 120596 V) is given by

119897 (120579) = constant minus 119899 log120596 +

119899

sum

119894=1

log 119905 (119911119894

) +

119899

sum

119894=1

logΦ 119875119870

(119911119894

)

(3)And we have

120597119897 (120579)

120597120573= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

))

120597120578119894

120597120573

120597119897 (120579)

120597120596= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)119911119894

+ 119911119894

1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)+ 119899)

120597119897 (120579)

120597V=

119899

sum

119894=1

1199112

119894

119905 (119911119894

)+

119899

2(119863119866 (

V + 1

2) minus 119863119866 (

V2

) minus1

2)

120597119897 (120579)

1205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)1198751015840

119870

(119911119894

) 119895 = 1 2 119870

(4)

where 120578119894

= 120578(x119894

120573) and 119863119866(119909) = (119889[log Γ(119909)])119889119909 The cor-responding second-order derivatives of (3) can be shown as

1205972

119897 (120579)

120597120573120597120573119879

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

minus1

120596[

1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

1205972

119897 (120579)

120597120573120597120596=

119899

sum

119894=1

1

1205962

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

+119911119894

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

120597120573120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

1205971205731205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

[1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573 119895 = 1 2 119870

1205972

119897 (120579)

1205971205962

=1

1205962

119899

sum

119894=1

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)] 3119911119894

+ 1199112

119894

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

119897 (120579)

120597120596120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

1205972

119897 (120579)

1205971205961205971205722119895minus1

= minus

119899

sum

119894=1

[1

120596(1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)) 119911119894

]

times 1199112119895minus1

119894

119895 = 1 2 119870

4 Abstract and Applied Analysis

000

005

010

015

020

025

030

x

Density

FSNSTNFSTN

minus2 0 2 4 6

(a)

x

FSNSTNFSTN

000

005

010

015

020

025

030

035

Density

minus2 0 2 4 6

(b)

x

FSNSTNFSTN

minus2 0

000

005

010

015

020

025

030

Density

2 4 6

(c)

x

FSNSTNFSTN

000

005

010

015

020

025

030

Density

minus2 0 2 4 6

(d)

Figure 1 ((a)ndash(d)) Density functions of FSN STN and FSTN distributions with four different situations

1205972

119897 (120579)

120597V2=

119899

2[119879119866 (

V + 1

2) minus 119879119866 (

V2

)] +

119899

sum

119894=1

log (119905 (119911119894

))

1205972

119897 (120579)

1205971205722119895minus1

1205971205722119896minus1

=

119899

sum

119894=1

119911119895+119896

119894

119876 119875119870

(119911119894

) 119895 119896 = 1 2 119870

1205972

119897 (120579)

120597V1205971205722119895minus1

= 0 119895 = 1 2 119870

(5)

where 119879119866(119909) = (1198892

[log Γ(119909)])1198891199092 ℎ(119909) = (119905

10158401015840

(119909)119905(119909) minus

[1199051015840

(119909)]2

)1199052

(119909) and 119876(119909) = (120601(119909)Φ(119909))[119909 + (120601(119909)Φ(119909))]Assume119880(120579) = (119880

119879

(120573) 119880(120596) 119880(V) 119880119879(120572))119879 is the gradi-ent or score vector and 119867(120579) is the Hessian matrix composedof the above second-order derivatives To obtain the MLestimate of 120579 the Newton-Raphson iteration algorithm isdefined by

(119896+1)

= (119896)

minus [119867(120579)]

minus1

120579=

120579

(119896) [119880(120579)]120579=

120579

(119896) (6)

Abstract and Applied Analysis 5

It is noted that the above iterative procedure is an unpar-titioned algorithm that is all the parameters includingnonlinear regression coefficients 120573 scale parameter 120596 andshape parameter 120572 as well as tail thickness parameter V areestimated simultaneously For our considered problem thereare at least two difficulties that may be encountered for (6)the first one is that once the number of the parameters to beestimated becomes large the corresponding computationalburden turns to be heavy with an unacceptable estimationerror and the second one is as follows when the strength ofnonlinearity of the link function 120578(sdot sdot) changes the iterativeprocessmay becomeunstable or even nonconvergent leadingto the poor estimation results Considering the needs ofpractical problems rather than the total parameter set weare usually interested in a proper subset of it To improvethe efficiency of the algorithm and to facilitate statisticalinference of the nonlinearmodels with FSTNdistribution weput forward the following profile likelihoodmethod based on(3) and (6)

4 Profile Likelihood Inference

41 Profile Estimation Algorithm Let 120579 = (120573119879

120574119879

) be a parti-tion of 120579 where 120573 is a parameter vector of nonlinear regres-sion an interest parameter and 120574 is a 119902-dimension (119902 = 119870+

2) nuisance parameter with 120574 = (120572 120596 V)119879 Similarly thepartition of 119867 and 119880 is given as 119867(120579) = (

11986711 11986712

11986721 11986722) where

11986711

= (1205972

119897(120579)120597120573120597120573119879

) 11986712

= (1205972

119897(120579)120597120573120597120574119879

) = (1205972

119897(120579)

120597120573120597120596 1205972

119897(120579)120597120573120597V 1205972119897(120579)1205971205731205971205721

1205972

119897(120579)1205971205731205971205722119896minus1

) 11986712

= [11986721

]119879 and 119867

22

= (1205972

119897(120579) 120597120574120597120574119879

) and the diagonalelements of 119867

22

are given by 1205972

119897(120579) 1205971205962

1205972

119897(120579)120597V21205972

119897(120579)1205971205722

1

1205972

119897(120579)1205971205722

2119896minus1

respectively and the off-diagonal elements in 119867

22

can be obtained similarly Let119880(120579) = (119880

1

1198802

) corresponding to the partition of 120579In the subsequent context we focus on the estimation of120579 based on profile likelihood method [18] Firstly suppose 120573is known and we rewrite the original likelihood function (3)as

119897 (120579) = 119897 (120573 120574) = 119897120573

(120574) (7)

where notation 119897120573

(120574) denotes that 120573 is fixed but 120574 varies Foreach 120573 to estimate 120574 we can obtain

120573

= argmax120574

119897120573

(120574) (8)

Alternatively to estimate 120573 we evaluate the maximum valueof 119897120573

(120574) over 120573

and have

119901

= argmax120573

119897120573

(120573

) = argmax120573

119897 (120573 120573

) (9)

where 119897(120573 120573

) and 119901

are referred to as the profile likelihoodfunction and the profile ML estimation respectively

Following [19] we define the profile Newton-Raphsoniteration formula as follows

(119896+1)

119901

= (119896)

119901

minus [119867120573|120574

]minus1

(119880120573|120574

)

(119896+1)

119901

= (119896)

119901

minus ((11986722

)minus1

1198801

+ (11986722

)minus1

11986721

[119867120573|120574

]minus1

(119880120573|120574

))

(10)

where 119880120573|120574

= 1198801

minus 11986712

(11986722

)minus1

1198802

119867120573|120574

= 11986711

minus

11986712

(11986722

)minus1

11986721

and all the matrices and the vectors on theright hand side of (10) are evaluated at

(119896)

119901

and (119896)119901

Note that both in (6) and in (10) the strength of

nonlinearity of link function 120578(sdot sdot) is reflected by 1205972

120578119894

120597120573120597120573119879

to large extents Therefore by examination of the expressionof 1205972

119897(120579)120597120573120597120573119879 we find that when the element of

100381610038161003816100381610038161003816100381610038161003816

120596 [1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

100381610038161003816100381610038161003816100381610038161003816

(11)

is much less than the corresponding element of1003816100381610038161003816100381610038161003816100381610038161003816

120597120578119879

119894

120597120573[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573

100381610038161003816100381610038161003816100381610038161003816

(12)

the following approximating result can be obtained as

119867lowast

11

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

(13)

And then the iteration formulas of 119880lowast

120573|120574

[119867lowast

120573|120574

]minus1 119901

and

lowast

119901

can also be obtained just as before The above estimationprocedure is referred to as profile modified Newton-Raphsoniteration algorithm

The stopping rule for the above algorithm can bepresented as that iteration proceeds until some distanceinvolving two successive evaluations of the profile likeli-hood 119897(120573

120573

) such as |119897((119896+1)

120573

(119896+1)) minus 119897((119896)

120573

(119896))| or

|119897((119896+1)

120573

(119896+1))119897((119896)

120573

(119896))minus1| is small enough for example120576 lt 10

minus4 is adopted in this paperThe asymptotic covariancematrix of theML estimates for

profile likelihood can be evaluated by inverting the expectedinformation matrix however it does not have a closedform expression the observed information matrix 119869(120573) =

minus(1205972

119897(120573 120573

)120597120573120597120573119879

) can be used as a replacement which isestimated by 119867

11

minus 11986712

(11986722

)minus1

11986721

where 11986711

11986712

and 11986722

can be obtained as similar as aboveThe choice of the initial values plays an important role in

nonlinear regression fitting in this paper the specific steps

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 3: Likelihood Inference of Nonlinear Models Based on a Class of

Abstract and Applied Analysis 3

the definition of STN distribution and as a consequence itshares analogous feature with these two distributions Forall of that the FSTN distribution presents some interestingand peculiar features and is able to regulate the density ina more flexible way To be particular except for modelingskewness and tail thickness the FSTN distribution allows formultimodality depending on the specific setting of 120572 For thepurpose of comparison and illustration we assume 119891

0

(119911) =

120601(119911) and 119866(119911) = Φ(119911) in FSS distribution which is denotedby FSN and then we set 119870 = 2 in 119875

119870

(119911) that is 119875119870

(119909) =

1205721

119909 + 1205723

1199093 in STN distribution and for this case different

selections of 1205721

and 1205723

determine whether the density isunimodal or bimodal Moreover the same assumption for 119866

and 119875 in FSTN distribution is madeFigure 1 displays the density functions of FSN and STN

as well as FSTN distributions with four different situationsconsidered namely 120572

1

= 1 1205723

= 0 and V = 10 1205721

= 1205723

= 1

and V = 6 1205721

= 1 1205723

= minus1 and V = 4 1205721

= minus11205723

= 1 and V = 4 respectively with 120583 = 1 120596 = 15 for allcases By examination of Figure 1 we can detect how thesethree densities change with different combinations of 120572 andV For instance in Figure 1(a) FSN STN and FSTN appearto be very close while STN and FSTN are heavy tailed to alittle extent In Figure 1(b) both FSN and FSTN are unimodalwhen 120572

1

and 1205723

keep the same sign and the ranking for thedegree of skewness is STN FSTN and FSN in turn that isthe STN and FSTN distributions have thicker tails comparedto FSN distribution With opposite sign of 120572

1

and 1205723

bothFSN and FSTN distributions are bimodal and highly skewedin Figures 1(c) and 1(d) moreover in the same directionFSTN has thicker tails than FSN distribution Our proposedFSTN distribution can be treated as a proper compromisebetween the FSS distribution and the STN distribution Itallows for a wider range of tail behavior compared to FSSdistributionwhereas it is able to accommodatemultimodalitywhich cannot be described by STN distribution From theapplied viewpoint the FSTN distribution is an appealingoption which can be expected to yield robust inferentialresults in the presence of outlying observations

3 Likelihood Inference

Consider 119899 independent observations satisfying a nonlinearregression model as [119910

119894

] sim FSTN(120583119894

120596120572 V) with 120583119894

= 120578(x119894

120573) for 119894 = 1 2 119899 Here x

119894

is a 119901-dimensional vector and 120573is a 119901 times 1 vector of parameters Also let X = (x

1

x119899

)119879 be

the 119899 times 119901 design matrix 120578(sdot sdot) is a known twice differentiablefunction Then the corresponding log-likelihood for param-eter 120579 = (120573120572 120596 V) is given by

119897 (120579) = constant minus 119899 log120596 +

119899

sum

119894=1

log 119905 (119911119894

) +

119899

sum

119894=1

logΦ 119875119870

(119911119894

)

(3)And we have

120597119897 (120579)

120597120573= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

))

120597120578119894

120597120573

120597119897 (120579)

120597120596= minus

1

120596

119899

sum

119894=1

(1199051015840

(119911119894

)

119905 (119911119894

)119911119894

+ 119911119894

1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)+ 119899)

120597119897 (120579)

120597V=

119899

sum

119894=1

1199112

119894

119905 (119911119894

)+

119899

2(119863119866 (

V + 1

2) minus 119863119866 (

V2

) minus1

2)

120597119897 (120579)

1205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)1198751015840

119870

(119911119894

) 119895 = 1 2 119870

(4)

where 120578119894

= 120578(x119894

120573) and 119863119866(119909) = (119889[log Γ(119909)])119889119909 The cor-responding second-order derivatives of (3) can be shown as

1205972

119897 (120579)

120597120573120597120573119879

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

minus1

120596[

1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

1205972

119897 (120579)

120597120573120597120596=

119899

sum

119894=1

1

1205962

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

+119911119894

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

120597120573120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

120597120578119894

120597120573

1205972

119897 (120579)

1205971205731205971205722119895minus1

=

119899

sum

119894=1

1199112119895minus1

119894

[1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573 119895 = 1 2 119870

1205972

119897 (120579)

1205971205962

=1

1205962

119899

sum

119894=1

[1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)] 3119911119894

+ 1199112

119894

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

119897 (120579)

120597120596120597V= minus

1

120596

119899

sum

119894=1

[

(1 + Vminus1) 1199113

119894

minus 119905 (119911119894

) 1199112

119894

V21199052 (119911119894

)]

1205972

119897 (120579)

1205971205961205971205722119895minus1

= minus

119899

sum

119894=1

[1

120596(1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)) 119911119894

]

times 1199112119895minus1

119894

119895 = 1 2 119870

4 Abstract and Applied Analysis

000

005

010

015

020

025

030

x

Density

FSNSTNFSTN

minus2 0 2 4 6

(a)

x

FSNSTNFSTN

000

005

010

015

020

025

030

035

Density

minus2 0 2 4 6

(b)

x

FSNSTNFSTN

minus2 0

000

005

010

015

020

025

030

Density

2 4 6

(c)

x

FSNSTNFSTN

000

005

010

015

020

025

030

Density

minus2 0 2 4 6

(d)

Figure 1 ((a)ndash(d)) Density functions of FSN STN and FSTN distributions with four different situations

1205972

119897 (120579)

120597V2=

119899

2[119879119866 (

V + 1

2) minus 119879119866 (

V2

)] +

119899

sum

119894=1

log (119905 (119911119894

))

1205972

119897 (120579)

1205971205722119895minus1

1205971205722119896minus1

=

119899

sum

119894=1

119911119895+119896

119894

119876 119875119870

(119911119894

) 119895 119896 = 1 2 119870

1205972

119897 (120579)

120597V1205971205722119895minus1

= 0 119895 = 1 2 119870

(5)

where 119879119866(119909) = (1198892

[log Γ(119909)])1198891199092 ℎ(119909) = (119905

10158401015840

(119909)119905(119909) minus

[1199051015840

(119909)]2

)1199052

(119909) and 119876(119909) = (120601(119909)Φ(119909))[119909 + (120601(119909)Φ(119909))]Assume119880(120579) = (119880

119879

(120573) 119880(120596) 119880(V) 119880119879(120572))119879 is the gradi-ent or score vector and 119867(120579) is the Hessian matrix composedof the above second-order derivatives To obtain the MLestimate of 120579 the Newton-Raphson iteration algorithm isdefined by

(119896+1)

= (119896)

minus [119867(120579)]

minus1

120579=

120579

(119896) [119880(120579)]120579=

120579

(119896) (6)

Abstract and Applied Analysis 5

It is noted that the above iterative procedure is an unpar-titioned algorithm that is all the parameters includingnonlinear regression coefficients 120573 scale parameter 120596 andshape parameter 120572 as well as tail thickness parameter V areestimated simultaneously For our considered problem thereare at least two difficulties that may be encountered for (6)the first one is that once the number of the parameters to beestimated becomes large the corresponding computationalburden turns to be heavy with an unacceptable estimationerror and the second one is as follows when the strength ofnonlinearity of the link function 120578(sdot sdot) changes the iterativeprocessmay becomeunstable or even nonconvergent leadingto the poor estimation results Considering the needs ofpractical problems rather than the total parameter set weare usually interested in a proper subset of it To improvethe efficiency of the algorithm and to facilitate statisticalinference of the nonlinearmodels with FSTNdistribution weput forward the following profile likelihoodmethod based on(3) and (6)

4 Profile Likelihood Inference

41 Profile Estimation Algorithm Let 120579 = (120573119879

120574119879

) be a parti-tion of 120579 where 120573 is a parameter vector of nonlinear regres-sion an interest parameter and 120574 is a 119902-dimension (119902 = 119870+

2) nuisance parameter with 120574 = (120572 120596 V)119879 Similarly thepartition of 119867 and 119880 is given as 119867(120579) = (

11986711 11986712

11986721 11986722) where

11986711

= (1205972

119897(120579)120597120573120597120573119879

) 11986712

= (1205972

119897(120579)120597120573120597120574119879

) = (1205972

119897(120579)

120597120573120597120596 1205972

119897(120579)120597120573120597V 1205972119897(120579)1205971205731205971205721

1205972

119897(120579)1205971205731205971205722119896minus1

) 11986712

= [11986721

]119879 and 119867

22

= (1205972

119897(120579) 120597120574120597120574119879

) and the diagonalelements of 119867

22

are given by 1205972

119897(120579) 1205971205962

1205972

119897(120579)120597V21205972

119897(120579)1205971205722

1

1205972

119897(120579)1205971205722

2119896minus1

respectively and the off-diagonal elements in 119867

22

can be obtained similarly Let119880(120579) = (119880

1

1198802

) corresponding to the partition of 120579In the subsequent context we focus on the estimation of120579 based on profile likelihood method [18] Firstly suppose 120573is known and we rewrite the original likelihood function (3)as

119897 (120579) = 119897 (120573 120574) = 119897120573

(120574) (7)

where notation 119897120573

(120574) denotes that 120573 is fixed but 120574 varies Foreach 120573 to estimate 120574 we can obtain

120573

= argmax120574

119897120573

(120574) (8)

Alternatively to estimate 120573 we evaluate the maximum valueof 119897120573

(120574) over 120573

and have

119901

= argmax120573

119897120573

(120573

) = argmax120573

119897 (120573 120573

) (9)

where 119897(120573 120573

) and 119901

are referred to as the profile likelihoodfunction and the profile ML estimation respectively

Following [19] we define the profile Newton-Raphsoniteration formula as follows

(119896+1)

119901

= (119896)

119901

minus [119867120573|120574

]minus1

(119880120573|120574

)

(119896+1)

119901

= (119896)

119901

minus ((11986722

)minus1

1198801

+ (11986722

)minus1

11986721

[119867120573|120574

]minus1

(119880120573|120574

))

(10)

where 119880120573|120574

= 1198801

minus 11986712

(11986722

)minus1

1198802

119867120573|120574

= 11986711

minus

11986712

(11986722

)minus1

11986721

and all the matrices and the vectors on theright hand side of (10) are evaluated at

(119896)

119901

and (119896)119901

Note that both in (6) and in (10) the strength of

nonlinearity of link function 120578(sdot sdot) is reflected by 1205972

120578119894

120597120573120597120573119879

to large extents Therefore by examination of the expressionof 1205972

119897(120579)120597120573120597120573119879 we find that when the element of

100381610038161003816100381610038161003816100381610038161003816

120596 [1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

100381610038161003816100381610038161003816100381610038161003816

(11)

is much less than the corresponding element of1003816100381610038161003816100381610038161003816100381610038161003816

120597120578119879

119894

120597120573[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573

100381610038161003816100381610038161003816100381610038161003816

(12)

the following approximating result can be obtained as

119867lowast

11

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

(13)

And then the iteration formulas of 119880lowast

120573|120574

[119867lowast

120573|120574

]minus1 119901

and

lowast

119901

can also be obtained just as before The above estimationprocedure is referred to as profile modified Newton-Raphsoniteration algorithm

The stopping rule for the above algorithm can bepresented as that iteration proceeds until some distanceinvolving two successive evaluations of the profile likeli-hood 119897(120573

120573

) such as |119897((119896+1)

120573

(119896+1)) minus 119897((119896)

120573

(119896))| or

|119897((119896+1)

120573

(119896+1))119897((119896)

120573

(119896))minus1| is small enough for example120576 lt 10

minus4 is adopted in this paperThe asymptotic covariancematrix of theML estimates for

profile likelihood can be evaluated by inverting the expectedinformation matrix however it does not have a closedform expression the observed information matrix 119869(120573) =

minus(1205972

119897(120573 120573

)120597120573120597120573119879

) can be used as a replacement which isestimated by 119867

11

minus 11986712

(11986722

)minus1

11986721

where 11986711

11986712

and 11986722

can be obtained as similar as aboveThe choice of the initial values plays an important role in

nonlinear regression fitting in this paper the specific steps

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 4: Likelihood Inference of Nonlinear Models Based on a Class of

4 Abstract and Applied Analysis

000

005

010

015

020

025

030

x

Density

FSNSTNFSTN

minus2 0 2 4 6

(a)

x

FSNSTNFSTN

000

005

010

015

020

025

030

035

Density

minus2 0 2 4 6

(b)

x

FSNSTNFSTN

minus2 0

000

005

010

015

020

025

030

Density

2 4 6

(c)

x

FSNSTNFSTN

000

005

010

015

020

025

030

Density

minus2 0 2 4 6

(d)

Figure 1 ((a)ndash(d)) Density functions of FSN STN and FSTN distributions with four different situations

1205972

119897 (120579)

120597V2=

119899

2[119879119866 (

V + 1

2) minus 119879119866 (

V2

)] +

119899

sum

119894=1

log (119905 (119911119894

))

1205972

119897 (120579)

1205971205722119895minus1

1205971205722119896minus1

=

119899

sum

119894=1

119911119895+119896

119894

119876 119875119870

(119911119894

) 119895 119896 = 1 2 119870

1205972

119897 (120579)

120597V1205971205722119895minus1

= 0 119895 = 1 2 119870

(5)

where 119879119866(119909) = (1198892

[log Γ(119909)])1198891199092 ℎ(119909) = (119905

10158401015840

(119909)119905(119909) minus

[1199051015840

(119909)]2

)1199052

(119909) and 119876(119909) = (120601(119909)Φ(119909))[119909 + (120601(119909)Φ(119909))]Assume119880(120579) = (119880

119879

(120573) 119880(120596) 119880(V) 119880119879(120572))119879 is the gradi-ent or score vector and 119867(120579) is the Hessian matrix composedof the above second-order derivatives To obtain the MLestimate of 120579 the Newton-Raphson iteration algorithm isdefined by

(119896+1)

= (119896)

minus [119867(120579)]

minus1

120579=

120579

(119896) [119880(120579)]120579=

120579

(119896) (6)

Abstract and Applied Analysis 5

It is noted that the above iterative procedure is an unpar-titioned algorithm that is all the parameters includingnonlinear regression coefficients 120573 scale parameter 120596 andshape parameter 120572 as well as tail thickness parameter V areestimated simultaneously For our considered problem thereare at least two difficulties that may be encountered for (6)the first one is that once the number of the parameters to beestimated becomes large the corresponding computationalburden turns to be heavy with an unacceptable estimationerror and the second one is as follows when the strength ofnonlinearity of the link function 120578(sdot sdot) changes the iterativeprocessmay becomeunstable or even nonconvergent leadingto the poor estimation results Considering the needs ofpractical problems rather than the total parameter set weare usually interested in a proper subset of it To improvethe efficiency of the algorithm and to facilitate statisticalinference of the nonlinearmodels with FSTNdistribution weput forward the following profile likelihoodmethod based on(3) and (6)

4 Profile Likelihood Inference

41 Profile Estimation Algorithm Let 120579 = (120573119879

120574119879

) be a parti-tion of 120579 where 120573 is a parameter vector of nonlinear regres-sion an interest parameter and 120574 is a 119902-dimension (119902 = 119870+

2) nuisance parameter with 120574 = (120572 120596 V)119879 Similarly thepartition of 119867 and 119880 is given as 119867(120579) = (

11986711 11986712

11986721 11986722) where

11986711

= (1205972

119897(120579)120597120573120597120573119879

) 11986712

= (1205972

119897(120579)120597120573120597120574119879

) = (1205972

119897(120579)

120597120573120597120596 1205972

119897(120579)120597120573120597V 1205972119897(120579)1205971205731205971205721

1205972

119897(120579)1205971205731205971205722119896minus1

) 11986712

= [11986721

]119879 and 119867

22

= (1205972

119897(120579) 120597120574120597120574119879

) and the diagonalelements of 119867

22

are given by 1205972

119897(120579) 1205971205962

1205972

119897(120579)120597V21205972

119897(120579)1205971205722

1

1205972

119897(120579)1205971205722

2119896minus1

respectively and the off-diagonal elements in 119867

22

can be obtained similarly Let119880(120579) = (119880

1

1198802

) corresponding to the partition of 120579In the subsequent context we focus on the estimation of120579 based on profile likelihood method [18] Firstly suppose 120573is known and we rewrite the original likelihood function (3)as

119897 (120579) = 119897 (120573 120574) = 119897120573

(120574) (7)

where notation 119897120573

(120574) denotes that 120573 is fixed but 120574 varies Foreach 120573 to estimate 120574 we can obtain

120573

= argmax120574

119897120573

(120574) (8)

Alternatively to estimate 120573 we evaluate the maximum valueof 119897120573

(120574) over 120573

and have

119901

= argmax120573

119897120573

(120573

) = argmax120573

119897 (120573 120573

) (9)

where 119897(120573 120573

) and 119901

are referred to as the profile likelihoodfunction and the profile ML estimation respectively

Following [19] we define the profile Newton-Raphsoniteration formula as follows

(119896+1)

119901

= (119896)

119901

minus [119867120573|120574

]minus1

(119880120573|120574

)

(119896+1)

119901

= (119896)

119901

minus ((11986722

)minus1

1198801

+ (11986722

)minus1

11986721

[119867120573|120574

]minus1

(119880120573|120574

))

(10)

where 119880120573|120574

= 1198801

minus 11986712

(11986722

)minus1

1198802

119867120573|120574

= 11986711

minus

11986712

(11986722

)minus1

11986721

and all the matrices and the vectors on theright hand side of (10) are evaluated at

(119896)

119901

and (119896)119901

Note that both in (6) and in (10) the strength of

nonlinearity of link function 120578(sdot sdot) is reflected by 1205972

120578119894

120597120573120597120573119879

to large extents Therefore by examination of the expressionof 1205972

119897(120579)120597120573120597120573119879 we find that when the element of

100381610038161003816100381610038161003816100381610038161003816

120596 [1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

100381610038161003816100381610038161003816100381610038161003816

(11)

is much less than the corresponding element of1003816100381610038161003816100381610038161003816100381610038161003816

120597120578119879

119894

120597120573[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573

100381610038161003816100381610038161003816100381610038161003816

(12)

the following approximating result can be obtained as

119867lowast

11

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

(13)

And then the iteration formulas of 119880lowast

120573|120574

[119867lowast

120573|120574

]minus1 119901

and

lowast

119901

can also be obtained just as before The above estimationprocedure is referred to as profile modified Newton-Raphsoniteration algorithm

The stopping rule for the above algorithm can bepresented as that iteration proceeds until some distanceinvolving two successive evaluations of the profile likeli-hood 119897(120573

120573

) such as |119897((119896+1)

120573

(119896+1)) minus 119897((119896)

120573

(119896))| or

|119897((119896+1)

120573

(119896+1))119897((119896)

120573

(119896))minus1| is small enough for example120576 lt 10

minus4 is adopted in this paperThe asymptotic covariancematrix of theML estimates for

profile likelihood can be evaluated by inverting the expectedinformation matrix however it does not have a closedform expression the observed information matrix 119869(120573) =

minus(1205972

119897(120573 120573

)120597120573120597120573119879

) can be used as a replacement which isestimated by 119867

11

minus 11986712

(11986722

)minus1

11986721

where 11986711

11986712

and 11986722

can be obtained as similar as aboveThe choice of the initial values plays an important role in

nonlinear regression fitting in this paper the specific steps

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 5: Likelihood Inference of Nonlinear Models Based on a Class of

Abstract and Applied Analysis 5

It is noted that the above iterative procedure is an unpar-titioned algorithm that is all the parameters includingnonlinear regression coefficients 120573 scale parameter 120596 andshape parameter 120572 as well as tail thickness parameter V areestimated simultaneously For our considered problem thereare at least two difficulties that may be encountered for (6)the first one is that once the number of the parameters to beestimated becomes large the corresponding computationalburden turns to be heavy with an unacceptable estimationerror and the second one is as follows when the strength ofnonlinearity of the link function 120578(sdot sdot) changes the iterativeprocessmay becomeunstable or even nonconvergent leadingto the poor estimation results Considering the needs ofpractical problems rather than the total parameter set weare usually interested in a proper subset of it To improvethe efficiency of the algorithm and to facilitate statisticalinference of the nonlinearmodels with FSTNdistribution weput forward the following profile likelihoodmethod based on(3) and (6)

4 Profile Likelihood Inference

41 Profile Estimation Algorithm Let 120579 = (120573119879

120574119879

) be a parti-tion of 120579 where 120573 is a parameter vector of nonlinear regres-sion an interest parameter and 120574 is a 119902-dimension (119902 = 119870+

2) nuisance parameter with 120574 = (120572 120596 V)119879 Similarly thepartition of 119867 and 119880 is given as 119867(120579) = (

11986711 11986712

11986721 11986722) where

11986711

= (1205972

119897(120579)120597120573120597120573119879

) 11986712

= (1205972

119897(120579)120597120573120597120574119879

) = (1205972

119897(120579)

120597120573120597120596 1205972

119897(120579)120597120573120597V 1205972119897(120579)1205971205731205971205721

1205972

119897(120579)1205971205731205971205722119896minus1

) 11986712

= [11986721

]119879 and 119867

22

= (1205972

119897(120579) 120597120574120597120574119879

) and the diagonalelements of 119867

22

are given by 1205972

119897(120579) 1205971205962

1205972

119897(120579)120597V21205972

119897(120579)1205971205722

1

1205972

119897(120579)1205971205722

2119896minus1

respectively and the off-diagonal elements in 119867

22

can be obtained similarly Let119880(120579) = (119880

1

1198802

) corresponding to the partition of 120579In the subsequent context we focus on the estimation of120579 based on profile likelihood method [18] Firstly suppose 120573is known and we rewrite the original likelihood function (3)as

119897 (120579) = 119897 (120573 120574) = 119897120573

(120574) (7)

where notation 119897120573

(120574) denotes that 120573 is fixed but 120574 varies Foreach 120573 to estimate 120574 we can obtain

120573

= argmax120574

119897120573

(120574) (8)

Alternatively to estimate 120573 we evaluate the maximum valueof 119897120573

(120574) over 120573

and have

119901

= argmax120573

119897120573

(120573

) = argmax120573

119897 (120573 120573

) (9)

where 119897(120573 120573

) and 119901

are referred to as the profile likelihoodfunction and the profile ML estimation respectively

Following [19] we define the profile Newton-Raphsoniteration formula as follows

(119896+1)

119901

= (119896)

119901

minus [119867120573|120574

]minus1

(119880120573|120574

)

(119896+1)

119901

= (119896)

119901

minus ((11986722

)minus1

1198801

+ (11986722

)minus1

11986721

[119867120573|120574

]minus1

(119880120573|120574

))

(10)

where 119880120573|120574

= 1198801

minus 11986712

(11986722

)minus1

1198802

119867120573|120574

= 11986711

minus

11986712

(11986722

)minus1

11986721

and all the matrices and the vectors on theright hand side of (10) are evaluated at

(119896)

119901

and (119896)119901

Note that both in (6) and in (10) the strength of

nonlinearity of link function 120578(sdot sdot) is reflected by 1205972

120578119894

120597120573120597120573119879

to large extents Therefore by examination of the expressionof 1205972

119897(120579)120597120573120597120573119879 we find that when the element of

100381610038161003816100381610038161003816100381610038161003816

120596 [1199051015840

(119911119894

)

119905 (119911119894

)+ 1198751015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

1205972

120578119894

120597120573120597120573119879

100381610038161003816100381610038161003816100381610038161003816

(11)

is much less than the corresponding element of1003816100381610038161003816100381610038161003816100381610038161003816

120597120578119879

119894

120597120573[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

) + 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

times120597120578119894

120597120573

100381610038161003816100381610038161003816100381610038161003816

(12)

the following approximating result can be obtained as

119867lowast

11

=

119899

sum

119894=1

(120597120578119894

120597120573)

119879

1

1205962

[ℎ (119911119894

) + 1198751015840

119870

(119911119894

) 119876 119875119870

(119911119894

)

+ 11987510158401015840

119870

(119911119894

)120601 119875119870

(119911119894

)

Φ 119875119870

(119911119894

)]

120597120578119894

120597120573

(13)

And then the iteration formulas of 119880lowast

120573|120574

[119867lowast

120573|120574

]minus1 119901

and

lowast

119901

can also be obtained just as before The above estimationprocedure is referred to as profile modified Newton-Raphsoniteration algorithm

The stopping rule for the above algorithm can bepresented as that iteration proceeds until some distanceinvolving two successive evaluations of the profile likeli-hood 119897(120573

120573

) such as |119897((119896+1)

120573

(119896+1)) minus 119897((119896)

120573

(119896))| or

|119897((119896+1)

120573

(119896+1))119897((119896)

120573

(119896))minus1| is small enough for example120576 lt 10

minus4 is adopted in this paperThe asymptotic covariancematrix of theML estimates for

profile likelihood can be evaluated by inverting the expectedinformation matrix however it does not have a closedform expression the observed information matrix 119869(120573) =

minus(1205972

119897(120573 120573

)120597120573120597120573119879

) can be used as a replacement which isestimated by 119867

11

minus 11986712

(11986722

)minus1

11986721

where 11986711

11986712

and 11986722

can be obtained as similar as aboveThe choice of the initial values plays an important role in

nonlinear regression fitting in this paper the specific steps

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 6: Likelihood Inference of Nonlinear Models Based on a Class of

6 Abstract and Applied Analysis

for choosing the starting values are implemented as follows

(i) compute the initial value (0)

based on the nonlinearregression model with standard normal assumption

(ii) with (0)

fixed compute the initial values 120596(0) and

120572(0) for the SN and finite mixture SN assumptions

respectively

In order to simplify the estimation of parameter V wehave fixed integral values for V from 2 to 40 by one choose thevalue of V that maximizes the profile likelihood as V(0) andthen the initial values of

(0)

120596(0) 120572(0) V(0) that are requiredin the estimation procedure are all obtained

42 Profile Confidence Estimation and Hypothesis Test Con-fidence interval and hypothesis test play an important role instatistical inference and in the subsequent content we willconsider the profile confidence estimation andhypothesis testfor the parameters of interest in nonlinear regressionmodelsSuppose 119869(120579) = 119864[minus119867(120579)] the following regular conditionsfor likelihood inference are assumed

(R-i) int(120597 log 119871(x 120579)120597120579) sdot 119871(x 120579)119889x = 0

(R-ii) (120597120597120579) int 119871(x 120579) = int(120597119871(x 120579)120597120579)119889x = 0

(R-iii) (120597120597120579) int 119892(x)119871(x 120579)119889x = int 119892(x)(120597119871(x 120579)120597120579)119889xwhere 119892(x) is arbitrary measurable function

(R-iv) 119864(120597 log 119871(x 120579)120597120579)2

gt 0

Apart from the above assumptions in this paper someadditional conditions are assumed to hold for that

(A-i) 120578(x119894

120573) is twice continuously differentiable withrespect to 120573 isin 119877

119901(A-ii) 119863 = (120597120578

1

120597120573 120597120578119899

120597120573)119879 is full rank for all 120573

(A-iii) assume 1205790

is the true value of 120579 then there exist aneighbour region 119873

0

of 1205790

and a constant 119888 such that120582min(119869(120579)) gt 119888 for any 120579 isin 119873

0

Under the above assumption following [20] we have

2 119897 (119901

120573119901) minus 119897 (120573

0

1205730

) 997888rarr 1205942

(119901) (14)

where 1205730

is the true value of 120573 and the convergence is underthe meaning of convergence in probability Based on theprofile likelihood theory the confidence region of 120573 with theconfidence level 100(1 minus 120572) is given by

119862 (119884) = 120573 2 [119897 (119901

120573119901) minus 119897 (120573

120573

)] le 1205942

(119901 120572) (15)

where 1205942

(119901 120572) denotes the upper 120572 percentile of chi-squaredistribution with 119901 degrees of freedom Furthermore thehypothesis test

1198670

120573 = 1205730

against 1198671

120573 = 1205730

(16)

can also be considered and the corresponding test statisticsare presented by 2[119897(

120573

) minus 119897(1205730

1205730

)] Unlike standard

likelihood method profile likelihood confidence intervalsand hypothesis test do not need assumption of normality ofthe estimator or distribution which is too restrictive they arebased on an asymptotic chi-square distribution of the log pro-file likelihood ratio test statistics and these properties bring alot of convenience and feasibility in practical computation

5 Simulation Study

To investigate the experimental performance of our method-ology we undertake a simulation study to compare the fittingperformance of misspecified distribution as well as under-standing the large sample properties of the ML estimates Torealize this purpose we generate the artificial data from thefollowing two nonlinear models

Model (1) drug responsiveness model

119910119894

= 1205730

minus1205730

1 + (119909119894

1205731

)1205732

+ 120576119894

(17)

Model (2) nonlinear growth-curve model

119910119894

=1205731

1 + 1205732

exp (minus1205733

119909)+ 120576119894

(18)

with two kinds of skewed distributions for random error asfollows Case (I) 120576 sim STN(120583 120596 120572 V) and Case (II) 120576 sim

FSTN(120583 120596120572 V)The true parameter population is chosen as follows 120573 =

(2 5 minus08) is assumed to be the same for the above twomodels In Model (1) let 120583 = 120575

1

120596 = 15 120572 = 15 and V = 4

and in Model (2) let 120583 = 1205752

120596 = 15 120572 = (1 minus08) and V = 4Note that for the values of the degrees of freedom V a relativevalue (V = 4) can yield a heavy-tailed distribution as we need

In this simulation there are totally four simulated datasets corresponding to two nonlinear models Model (1) andModel (2) along with two distributions Case (I) and Case(II) for random error Similar to previous analysis eachsimulated data set is fitted under STN and FSN as well asFSTN scenarios using three different estimation algorithmsTo shed light on the experimental performance of ourmethodology an interesting comparison can be made byexamining how often we can recognize the true model

Table 1 shows the absolute value of the average biasbetween the true and the estimated parameters and thepercentages of each model being ranked as the best modelbased on AIC criterion out of 500 replications are alsopresented in Table 1

By examination of Table 1 we can find the following(i) the difference for the estimation results between drugresponsiveness model and growth-curve model is significantindicating that the nonlinearity of model imposes an impacton parameter estimation (ii) when the true distribution forrandom error is STN the three fitting distributions haveroughly the same behavior it is hard to tell which approachfor parameter estimation is good and which approach isbad and the similar results can be obtained for modelselection based on AIC (iii) when the true distribution forrandom error is FSTN the FSN and the FSTN distributions

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 7: Likelihood Inference of Nonlinear Models Based on a Class of

Abstract and Applied Analysis 7

Table 1 Results of the simulation study

Method Model Error Fittingmodel

Bias and perc1205730

1205731

1205732

()

NR

1

Case ISTN 0048 0044 0034 384

FSN 0052 0043 0031 302

FSTN 0051 0048 0035 314

Case IISTN 0941 0445 0374 105

FSN 0035 0014 0011 438

FSTN 0033 0010 0010 457

2

Case ISTN 0067 0061 0053 362

FSN 0070 0057 0055 302

FSTN 0073 0072 0048 336

Case IISTN 0963 0469 0391 85

FSN 0047 0023 0017 446

FSTN 0043 0022 0021 469

P-NR

1

Case ISTN 0049 0040 0032 326

FSN 0050 0047 0038 401

FSTN 0052 0049 0036 273

Case IISTN 1029 0420 0248 114

FSN 0039 0018 0010 396

FSTN 0038 0017 0009 490

2

Case ISTN 0070 0050 0047 303

FSN 0067 0057 0056 423

FSTN 0073 0071 0050 274

Case IISTN 1052 0446 0265 102

FSN 0053 0031 0020 396

FSTN 0052 0022 0017 502

MP-NR

1

Case ISTN 0045 0049 0033 323FSN 0051 0047 0036 383FSTN 0045 0044 0031 294

Case IISTN 0773 0373 0266 145FSN 0031 0018 0011 384FSTN 0028 0010 0008 471

2

Case ISTN 0068 0072 0062 281FSN 0076 0074 0062 412FSTN 0073 0060 0054 307

Case IISTN 0794 0394 0290 113FSN 0037 0026 0017 394FSTN 0036 0021 0013 493

outperform the STNdistribution by producing estimateswithlower bias and higher AIC proportion furthermore the PNRand MPNR methods perform better than the traditional NRmethod in general

To study the consistence properties of ML estimate wefocus on the situation that the true distribution for randomerror is Case (II) whereas the fitting distribution is FSTN tooFor this case two estimation algorithms PNR and MPNR

are adopted for parameter estimation and samples of differentsizes (119899 = 50 100 and 200) are generated from Models (1)

and (2) We compute the 95 confidence interval for theparameters of interest and the mean square error (MSE) fordifferent model where MSE(120573) = (1500) sum

500

119894=1

120573(119894)

minus 1205732

The length of the 95 confidence interval and theMSE resultsare summarized in Table 2 From Table 2 we can see thatthe length of the confidence interval and the MSE tend todecrease as the sample size increases as expected

Tables 1 and 2 show that in general FSTN distributionenjoys more robustness and flexibility in modeling data withskewness and heavy tails as well asmultimodality in compari-sonwith other skewed alternatives and the implementation ofMPNR method brings more accuracy and improvement formodel estimation in the context of nonlinear regression withthis new distribution

6 Conclusion

We have proposed a new skewed distribution based on thegeneral FSS distribution framework called the FSTN dis-tribution which is allowed to accommodate multimodalityasymmetry and heavy tails jointly to offer greater flexibilitythan SN and STN counterpartsMoreover we have developeda modified profile of Newton-Raphson iterative algorithmfor estimating the parameters of interest of nonlinear modelwith FSTN distribution and the interval estimation andhypothesis test in a profile likelihood paradigm are alsoconsidered

Numerical studies reveal that in the context of nonlinearregression analysis if the true distribution is STN or FSNwhereas the fitting distribution is FSTN the estimationresults are not influenced by this misspecification of distribu-tion assumptionHowever once the true distribution is FSTNwhile the fitting distribution is STN the estimation resultsappear to be somewhat disappointing which shows therobustness of FSTN distribution In general the combinationof FSTN distribution with MPNR method brings moreaccuracy and improvement on the estimation of nonlinearregression

So far the present methodology is limited to the completedata analysis the extensions of this paper include missingdata version as well as Bayesian analysis of this model whichwill be reported in another paper

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was supported by the National Natural ScienceFoundation of China under Grants 11171105 and 61273021 andin part by the Natural Science Foundation Project of CQcstc2013jjB40008

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 8: Likelihood Inference of Nonlinear Models Based on a Class of

8 Abstract and Applied Analysis

Table 2 The table shows the length of 95 confidence interval of parameters of interest and MSE of specified model

Model Sample size Profile-NR Modified profile-NR1205730

1205731

1205732

MSE 1205730

1205731

1205732

MSE

150 1015 0675 0374 0195 0959 0589 0341 0167100 0558 0365 0152 0051 0519 0254 0132 0073200 0172 0134 0076 0004 0120 0141 0081 0005

250 1258 0828 0449 0240 1152 0692 0390 0154100 0569 0448 0208 0099 0516 0427 0211 0022200 0232 0181 0122 0009 0214 0145 0120 0008

References

[1] G Verbeke and E Lesaffre ldquoA linear mixed-effects model withheterogeneity in the random-effects populationrdquo Journal of theAmerican Statistical Association vol 91 no 433 pp 217ndash2211996

[2] P GhoshMD Branco andH Chakraborty ldquoBivariate randomeffectmodel using skew-normal distributionwith application toHIV-RNArdquo Statistics in Medicine vol 26 no 6 pp 1255ndash12672007

[3] A Azzalini and M G Genton ldquoRobust likelihood methodsbased on the skew-t and related distributionsrdquo InternationalStatistical Review vol 76 no 1 pp 106ndash129 2008

[4] A Azzalini ldquoA class of distributions which includes the normalonesrdquo Scandinavian Journal of Statistics vol 12 no 2 pp 171ndash178 1985

[5] A Azzalini and A Capitanio ldquoDistributions generated byperturbation of symmetrywith emphasis on amultivariate skew119905-distributionrdquo Journal of the Royal Statistical Society Series BStatistical Methodology vol 65 no 2 pp 367ndash389 2003

[6] H W Gomez O Venegas and H Bolfarine ldquoSkew-symmetricdistributions generated by the distribution function of thenormal distributionrdquo Environmetrics vol 18 no 4 pp 395ndash4072007

[7] A Azzalini and A Dalla Valle ldquoThe multivariate skew-normaldistributionrdquo Biometrika vol 83 no 4 pp 715ndash726 1996

[8] A Azzalini and A Capitanio ldquoStatistical applications of themultivariate skew normal distributionrdquo Journal of the RoyalStatistical Society B Statistical Methodology vol 61 no 3 pp579ndash602 1999

[9] Y Ma and M G Genton ldquoFlexible class of skew-symmetricdistributionsrdquo Scandinavian Journal of Statistics vol 31 no 3pp 459ndash468 2004

[10] A Azzalini ldquoThe skew-normal distribution and related multi-variate familiesrdquo Scandinavian Journal of Statistics vol 32 no2 pp 159ndash200 2005

[11] H J Ho S Pyne and T I Lin ldquoMaximum likelihood inferencefor mixtures of skew Student-119905-normal distributions throughpractical EM-type algorithmsrdquo Statistics andComputing vol 22no 1 pp 287ndash299 2012

[12] C B Zeller VH Lachos and F E Vilca-Labra ldquoLocal influenceanalysis for regression models with scale mixtures of skew-normal distributionsrdquo Journal of Applied Statistics vol 38 no2 pp 343ndash368 2011

[13] V G Cancho D K Dey V H Lachos and M G AndradeldquoBayesian nonlinear regression models with scale mixturesof skew-normal distributions estimation and case influencediagnosticsrdquo Computational Statistics amp Data Analysis vol 55no 1 pp 588ndash602 2011

[14] M D Branco and D K Dey ldquoA general class of multivariateskew-elliptical distributionsrdquo Journal of Multivariate Analysisvol 79 no 1 pp 99ndash113 2001

[15] F C Xie J G Lin and B CWei ldquoDiagnostics for skew-normalnonlinear regression models with AR(1) errorsrdquo ComputationalStatistics and Data Analysis vol 53 no 12 pp 4403ndash4416 2009

[16] F-C Xie B-C Wei and J-G Lin ldquoHomogeneity diagnosticsfor skew-normal nonlinear regressions modelsrdquo Statistics ampProbability Letters vol 79 no 6 pp 821ndash827 2009

[17] L H Vanegas and F J Cysneiros ldquoAssessment of diagnosticprocedures in symmetrical nonlinear regression modelsrdquo Com-putational Statistics amp Data Analysis vol 54 no 4 pp 1002ndash1016 2010

[18] T A Severini ldquoAn approximation to the modified profilelikelihood functionrdquo Biometrika vol 85 no 2 pp 403ndash4111998

[19] G K Smyth ldquoPartitioned algorithms for maximum likelihoodand other non-linear estimationrdquo Statistics and Computing vol6 no 3 pp 201ndash216 1996

[20] B Wei Exponential Family Nonlinear Models Springer Singa-pore 1998

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 9: Likelihood Inference of Nonlinear Models Based on a Class of

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of