Presentation is loading. Please wait.

Presentation is loading. Please wait.

CD-HPF:New Habitability Score Via Data Analytic Modeling

Similar presentations


Presentation on theme: "CD-HPF:New Habitability Score Via Data Analytic Modeling"— Presentation transcript:

1 CD-HPF:New Habitability Score Via Data Analytic Modeling
Snehanshu Saha Professor., Computer Science Dept.,PESIT-BSC, Bangalore.

2 INTRODUCTION At present, only known habitable planet is Earth.
2 important questions to be answered – Can we have life’s existence exactly similar to Earth , somewhere else? Or Do we have life in unknown form existing somewhere else?

3 Introduction.. Astronomers uses mainly 2 important parameters to answer these 2 questions- Earth’s Similarity Index (ESI)- based on 4 parameters radius, surface temperature, density and escape velocity and is given as-

4 Introduction.. Planetary Habitability Index(PHI) which is basically to estimate 2nd question and is given as- Where S is Substrate, E is Energy, C is Chemistry of compounds and L is liquid solvent.

5 Introduction.. We are proposing an approach that makes use of Cobb Douglas Production Function to obtain the new habitability score for exoplanets.

6 Cobb Douglas Production Function (CDPF)
In general, CDPF is given as- Where Y is the production function and α and β are elasticity constants which are some positive fractions , k is a positive constant and x1 & x2 are input parameters. NOTE: Function can be extended for any number of inputs.

7 Elasticity Constants in CDPF
Sum of α & β helps us in deducing the important results about the function , If their sum is 1, function is homogeneous of degree 1, and is called as Constant returns to Scale(CRS), increase in 1 input gives increase in output in same proportion. If their sum is less than 1, it is decreasing returns to scale (DRS), where diminishing returns will set in.

8 Elasticity Constants in CDPF..
If the sum is more than 1, it is increasing returns to scale(IRS), here output increases with variable factors.

9 Use of CDPF to estimate CDHS
We have formulated CDPF to estimate Cobb Douglas Habitability Score(CDHS) for exoplanets. First, we have calculated interior and surface CDHS for each exoplanet and then used a convex combination of the two to compute final CDHS. While doing this we found and proved that function is maximized for CRS & DRS case.

10 Use of CDPF to estimate CDHS
We specify CDPF as- where above parameters are radius, density , surface temperature & escape velocity with their elasticity constraints.

11 Use of CDPF to estimate CDHS

12 Use of CDPF to estimate CDHS
Set criteria is to choose α,β,γ,δ so as to maximize Y. For CRS, where all the elasticities of different cost components are equal,Y can be specified as- and In such cases, Y is geometric mean of all inputs.

13 Data set For this work , we used HABCAT database, available at From this database we selected 664 confirmed exoplanets for which surface temperature was known.

14 Our Results CDPF was applied to get CDHS based on radius & density , we called it as CDHSi and another CDHS based on escape velocity and surface temperature and we called it as CDHSs. We got data in the range from to for CDHSi and from to for CDHSs.

15 Graphs for CDHSi & CDHSs
CDHSi Here α & β values are 0.8 & 0.1 respectively.

16 Graphs for CDHSi & CDHSs
CDHSs Here α & β values are 0.8 & 0.1 respectively.

17 CDHS Calculation Next we calculated final CDHS as-
where w’ was considered as 0.99 & w” as 0.01, because w’+w” must be equal to 1. Values obtained for CDHS were in the range from to

18 Classification based on CDHS
After calculating CDHS , we applied KNN classification algorithm to check the number of planets belonging to Earth’s Class. According to CDHS result , 5 classes are considered with k =7, where Earth’s class was class 4th. These classes have following ranges according to classification done-

19 KNN Algorithm Consider k as the desired number of nearest neighbors and S:=p1,...,pn be the set of training samples in the form pi=(xi,ci), where xi is the d-dimensional feature vector (1 in our case) of the point pi and ci is the class that pi belongs to. Let T:=p1’,p2’,…..pr’ be the testing samples. for each p’=(x’,c’) in T { compute the distance d(x’,xi) between p’ & all pi belonging to S; Sort pi according to d(x’,xi); Select k closest training samples to p’ from the sorted list; Assign a class to p’ based on majority vote ; }

20 Class Ranges According to CDHS
Class1 - From 1.98 to Class2 - Less than 1.98 to greater than 1.680 Class3 - From 1.68 to greater than 1.443 Class4 - From to greater than 1.23 Class5 - From 1.23 to greater than equal to

21 Result of Classification
80% of data is used as training and 20% data for testing. In Class5, we obtained 13 exoplanets. Result plot of KNN Accuracy obtained is 92.5%.

22 High Level Dataflow Planets highly likely to be habitable are- GJ163c
Employ KNN (preliminary classification) No. of classes Load the dataset Cobb Douglas Engine Attribute Filtering Computed CDHPF Reorganized Classes Final 5 classes Class 5 contains Earth & other habitable planets Planets highly likely to be habitable are- GJ163c GJ667C c GJ832c HD40307g Kepler-186f Kepler-62e Kepler-62f etc.

23 Acknowledgement I sincerely thank Ms. Kakoli Bora and Ms. Surbhi Agrawal, my Ph.D. students for sincere efforts. I acknowledge Dr. Margarita Safanova , Indian Institute of Astrophysics , who has been an excellent collaborator in this project.

24 THANK YOU


Download ppt "CD-HPF:New Habitability Score Via Data Analytic Modeling"

Similar presentations


Ads by Google