{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Day 6, Part 1 - Data processing, Info Viz\n", "While we'll focus a lot of scientific viz of your sims, we can also take this opportunity to do some \"info viz\" which can provide context to your sci viz. Info viz is an important skill to know how to do as well!\n", "\n", "Let's take a step back and look at some larger planet data to see how we might process a large list of data. In this example, we'll use more Kepler data.\n", "\n", "While filtering is important only sometimes for simulated data, it is generally necessary for observational data & if we want to compare our simulations to observations.\n", "\n", "We'll use the \"pandas\" package to do this which can be a useful thing to know how to use anyway!" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "# import our usual stuffs\n", "%matplotlib inline\n", "import matplotlib\n", "import matplotlib.pyplot as plt\n", "import numpy as np" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "# now, import pandas\n", "import pandas as pd" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "# now let's read in the kepler confirmed planets dataset\n", "planets = pd.read_csv('https://jnaiman.github.io/csci-p-14110/lesson06/data/planets_2019.07.12_17.16.25.csv', \n", " sep=\",\", comment=\"#\")\n", "#note: feel free to download this and read from your download as well" ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pl_hostnamepl_letterpl_namepl_discmethodpl_controvflagpl_pnumpl_orbperpl_orbpererr1pl_orbpererr2pl_orbperlim...st_sperrst_splimst_lumst_lumerr1st_lumerr2st_lumlimst_agest_ageerr1st_ageerr2st_agelim
011 Comb11 Com bRadial Velocity01326.0300000.320000-0.3200000.0...NaN0.02.2430.071-0.0850.0NaNNaNNaNNaN
111 UMib11 UMi bRadial Velocity01516.2199703.200000-3.2000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
214 Andb14 And bRadial Velocity01185.8400000.230000-0.2300000.0...NaN0.01.763NaNNaN0.0NaNNaNNaNNaN
314 Herb14 Her bRadial Velocity011773.4000202.500000-2.5000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
416 Cyg Bb16 Cyg B bRadial Velocity01798.5000001.000000-1.0000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
518 Delb18 Del bRadial Velocity01993.3000003.200000-3.2000000.0...NaN0.01.602NaNNaN0.0NaNNaNNaNNaN
61RXS J160929.1-210524b1RXS J160929.1-210524 bImaging01NaNNaNNaNNaN...NaN0.0-0.3700.150-0.1500.00.005NaNNaNNaN
724 Boob24 Boo bRadial Velocity0130.3506000.007800-0.0077000.0...NaN0.01.7740.047-0.0530.06.9204.830-2.7500.0
824 Sexb24 Sex bRadial Velocity02452.8000002.100000-4.5000000.0...NaN0.01.1640.003-0.0030.02.7000.400-0.400NaN
924 Sexc24 Sex cRadial Velocity02883.00000032.400000-13.8000000.0...NaN0.01.1640.003-0.0030.02.7000.400-0.400NaN
102MASS J01225093-2439505b2MASS J01225093-2439505 bImaging01NaNNaNNaNNaN...NaN0.0-1.7200.110-0.1100.00.1200.010-0.010NaN
112MASS J02192210-3925225b2MASS J02192210-3925225 bImaging01NaNNaNNaNNaN...NaN0.0-2.2300.060-0.0600.00.0350.005-0.005NaN
122MASS J04414489+2301513b2MASS J04414489+2301513 bImaging01NaNNaNNaNNaN...NaN0.0NaNNaNNaNNaN0.001NaNNaNNaN
132MASS J12073346-3932539b2MASS J12073346-3932539 bImaging01NaNNaNNaNNaN...NaN0.0NaNNaNNaNNaN0.0080.004-0.003NaN
142MASS J19383260+4603591b2MASS J19383260+4603591 bEclipse Timing Variations01416.0000002.000000-2.0000000.0...NaNNaNNaNNaNNaNNaNNaNNaNNaNNaN
152MASS J21402931+1625183 Ab2MASS J21402931+1625183 A bImaging017336.5000001934.500000-584.0000000.0...NaN0.0-3.4800.350-0.3500.0NaNNaNNaNNaN
162MASS J22362452+4751425b2MASS J22362452+4751425 bImaging01NaNNaNNaNNaN...NaN0.0NaNNaNNaNNaN0.1200.010-0.010NaN
1730 Ari Bb30 Ari B bRadial Velocity01335.1000102.500000-2.5000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
184 UMab4 UMa bRadial Velocity01269.3000001.960000-1.9600000.0...NaN0.0NaNNaNNaNNaN4.6042.000-2.000NaN
1942 Drab42 Dra bRadial Velocity01479.1000006.200000-6.2000000.0...NaN0.0NaNNaNNaNNaN9.4901.760-1.760NaN
2047 UMab47 UMa bRadial Velocity031078.0000002.000000-2.0000000.0...NaN0.0NaNNaNNaNNaN7.3001.900-1.900NaN
2147 UMac47 UMa cRadial Velocity032391.000000100.000000-87.0000000.0...NaN0.0NaNNaNNaNNaN7.3001.900-1.900NaN
2247 UMad47 UMa dRadial Velocity0314002.0000004018.000000-5095.0000000.0...NaN0.0NaNNaNNaNNaN7.3001.900-1.900NaN
2351 Erib51 Eri bImaging0111688.0000006209.250000-3287.2500000.0...NaN0.00.8500.060-0.0600.00.0200.006-0.006NaN
2451 Pegb51 Peg bRadial Velocity014.2307850.000036-0.0000360.0...NaN0.0NaNNaNNaNNaN4.0002.500-2.500NaN
2555 Cncb55 Cnc bRadial Velocity0514.6515200.000150-0.0001500.0...NaN0.0-0.2350.010-0.0110.010.2002.500-2.500NaN
2655 Cncc55 Cnc cRadial Velocity0544.4175000.007300-0.0073000.0...NaN0.0-0.2350.010-0.0110.010.2002.500-2.500NaN
2755 Cncd55 Cnc dRadial Velocity054825.00000039.000000-39.0000000.0...NaN0.0-0.2350.010-0.0110.010.2002.500-2.500NaN
2855 Cnce55 Cnc eRadial Velocity050.7365390.000007-0.0000070.0...NaN0.0-0.2350.010-0.0110.010.2002.500-2.500NaN
2955 Cncf55 Cnc fRadial Velocity05262.0000000.510000-0.5100000.0...NaN0.0-0.2350.010-0.0110.010.2002.500-2.500NaN
..................................................................
3986eps CrBbeps CrB bRadial Velocity01417.9000000.500000-0.5000000.0...NaN0.0NaNNaNNaN0.01.7400.370-0.370NaN
3987eps Eribeps Eri bRadial Velocity012502.00000010.000000-10.0000000.0...NaN0.0NaNNaNNaNNaN0.800NaNNaNNaN
3988eps Taubeps Tau bRadial Velocity01594.9000005.300000-5.3000000.0...NaN0.01.9870.037NaN0.0NaNNaNNaNNaN
3989gam 1 Leobgam 1 Leo bRadial Velocity01428.5000001.250000-1.2500000.0...NaN0.0NaNNaNNaNNaNNaNNaNNaNNaN
3990gam Cepbgam Cep bRadial Velocity01903.3000001.500000-1.5000000.0...NaN0.0NaNNaNNaNNaNNaNNaNNaNNaN
3991gam Libbgam Lib bRadial Velocity02415.2000001.800000-1.9000000.0...NaN0.01.8500.035-0.0380.02.8201.970-0.8200.0
3992gam Libcgam Lib cRadial Velocity02964.6000003.100000-3.1000000.0...NaN0.01.8500.035-0.0380.02.8201.970-0.8200.0
3993iot Drabiot Dra bRadial Velocity01511.0980000.089000-0.0890000.0...NaN0.0NaNNaNNaNNaNNaNNaNNaNNaN
3994kap Andbkap And bImaging01NaNNaNNaNNaN...NaN0.01.8300.040-0.0400.00.0300.120-0.010NaN
3995kap CrBbkap CrB bRadial Velocity011285.00000014.000000-14.0000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
3996mu Leobmu Leo bRadial Velocity01357.8000001.200000-1.2000000.0...NaN0.01.797NaNNaN0.03.3500.700-0.700NaN
3997nu Ophbnu Oph bRadial Velocity02530.3200000.350000-0.3500000.0...NaN0.02.090NaNNaN0.0NaNNaNNaNNaN
3998nu Ophcnu Oph cRadial Velocity023186.00000014.000000-14.0000000.0...NaN0.02.090NaNNaN0.0NaNNaNNaNNaN
3999ome Serbome Ser bRadial Velocity01277.0200000.520000-0.5100000.0...NaN0.01.850NaNNaN0.0NaNNaNNaNNaN
4000omi CrBbomi CrB bRadial Velocity01187.8300000.540000-0.5400000.0...NaN0.01.709NaNNaN0.0NaNNaNNaNNaN
4001omi UMabomi UMa bRadial Velocity011630.00000035.000000-35.0000000.0...NaN0.02.140NaNNaN0.0NaNNaNNaNNaN
4002HD 39091cpi Men cTransit026.2679000.000460-0.0004600.0...NaN0.00.1600.006-0.0060.02.9801.400-1.3000.0
4003psi 1 Dra Bbpsi 1 Dra B bRadial Velocity013117.00000042.000000-42.0000000.0...NaN0.0NaNNaNNaNNaN3.3001.000-1.000NaN
4004rho CrBbrho CrB bRadial Velocity1239.8458000.001500-0.0014000.0...NaN0.00.2320.011-0.0110.0NaNNaNNaNNaN
4005rho CrBcrho CrB cRadial Velocity02102.5400000.170000-0.1700000.0...NaN0.00.2320.011-0.0110.0NaNNaNNaNNaN
4006tau Boobtau Boo bRadial Velocity013.3124570.000007-0.0000070.0...NaN0.0NaNNaNNaNNaNNaNNaNNaNNaN
4007tau Cetetau Cet eRadial Velocity04162.8700001.080000-0.4600000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaNNaN
4008tau Cetftau Cet fRadial Velocity04636.13000011.700000-47.6900000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaNNaN
4009tau Cetgtau Cet gRadial Velocity0420.0000000.020000-0.0100000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaNNaN
4010tau Cethtau Cet hRadial Velocity0449.4100000.080000-0.1000000.0...NaN0.0NaNNaNNaN0.0NaNNaNNaNNaN
4011tau Gembtau Gem bRadial Velocity01305.5000000.100000-0.1000000.0...NaN0.0NaNNaNNaNNaN1.2200.760-0.760NaN
4012ups Andbups And bRadial Velocity034.6170330.000023-0.0000230.0...NaN0.0NaNNaNNaNNaN5.000NaNNaNNaN
4013ups Andcups And cRadial Velocity03241.2580000.064000-0.0640000.0...NaN0.0NaNNaNNaNNaN5.000NaNNaNNaN
4014ups Anddups And dRadial Velocity031276.4600000.570000-0.5700000.0...NaN0.0NaNNaNNaNNaN5.000NaNNaNNaN
4015xi Aqlbxi Aql bRadial Velocity01136.7500000.250000-0.2500000.0...NaN0.01.839NaNNaN0.0NaNNaNNaNNaN
\n", "

4016 rows × 88 columns

\n", "
" ], "text/plain": [ " pl_hostname pl_letter pl_name \\\n", "0 11 Com b 11 Com b \n", "1 11 UMi b 11 UMi b \n", "2 14 And b 14 And b \n", "3 14 Her b 14 Her b \n", "4 16 Cyg B b 16 Cyg B b \n", "5 18 Del b 18 Del b \n", "6 1RXS J160929.1-210524 b 1RXS J160929.1-210524 b \n", "7 24 Boo b 24 Boo b \n", "8 24 Sex b 24 Sex b \n", "9 24 Sex c 24 Sex c \n", "10 2MASS J01225093-2439505 b 2MASS J01225093-2439505 b \n", "11 2MASS J02192210-3925225 b 2MASS J02192210-3925225 b \n", "12 2MASS J04414489+2301513 b 2MASS J04414489+2301513 b \n", "13 2MASS J12073346-3932539 b 2MASS J12073346-3932539 b \n", "14 2MASS J19383260+4603591 b 2MASS J19383260+4603591 b \n", "15 2MASS J21402931+1625183 A b 2MASS J21402931+1625183 A b \n", "16 2MASS J22362452+4751425 b 2MASS J22362452+4751425 b \n", "17 30 Ari B b 30 Ari B b \n", "18 4 UMa b 4 UMa b \n", "19 42 Dra b 42 Dra b \n", "20 47 UMa b 47 UMa b \n", "21 47 UMa c 47 UMa c \n", "22 47 UMa d 47 UMa d \n", "23 51 Eri b 51 Eri b \n", "24 51 Peg b 51 Peg b \n", "25 55 Cnc b 55 Cnc b \n", "26 55 Cnc c 55 Cnc c \n", "27 55 Cnc d 55 Cnc d \n", "28 55 Cnc e 55 Cnc e \n", "29 55 Cnc f 55 Cnc f \n", "... ... ... ... \n", "3986 eps CrB b eps CrB b \n", "3987 eps Eri b eps Eri b \n", "3988 eps Tau b eps Tau b \n", "3989 gam 1 Leo b gam 1 Leo b \n", "3990 gam Cep b gam Cep b \n", "3991 gam Lib b gam Lib b \n", "3992 gam Lib c gam Lib c \n", "3993 iot Dra b iot Dra b \n", "3994 kap And b kap And b \n", "3995 kap CrB b kap CrB b \n", "3996 mu Leo b mu Leo b \n", "3997 nu Oph b nu Oph b \n", "3998 nu Oph c nu Oph c \n", "3999 ome Ser b ome Ser b \n", "4000 omi CrB b omi CrB b \n", "4001 omi UMa b omi UMa b \n", "4002 HD 39091 c pi Men c \n", "4003 psi 1 Dra B b psi 1 Dra B b \n", "4004 rho CrB b rho CrB b \n", "4005 rho CrB c rho CrB c \n", "4006 tau Boo b tau Boo b \n", "4007 tau Cet e tau Cet e \n", "4008 tau Cet f tau Cet f \n", "4009 tau Cet g tau Cet g \n", "4010 tau Cet h tau Cet h \n", "4011 tau Gem b tau Gem b \n", "4012 ups And b ups And b \n", "4013 ups And c ups And c \n", "4014 ups And d ups And d \n", "4015 xi Aql b xi Aql b \n", "\n", " pl_discmethod pl_controvflag pl_pnum pl_orbper \\\n", "0 Radial Velocity 0 1 326.030000 \n", "1 Radial Velocity 0 1 516.219970 \n", "2 Radial Velocity 0 1 185.840000 \n", "3 Radial Velocity 0 1 1773.400020 \n", "4 Radial Velocity 0 1 798.500000 \n", "5 Radial Velocity 0 1 993.300000 \n", "6 Imaging 0 1 NaN \n", "7 Radial Velocity 0 1 30.350600 \n", "8 Radial Velocity 0 2 452.800000 \n", "9 Radial Velocity 0 2 883.000000 \n", "10 Imaging 0 1 NaN \n", "11 Imaging 0 1 NaN \n", "12 Imaging 0 1 NaN \n", "13 Imaging 0 1 NaN \n", "14 Eclipse Timing Variations 0 1 416.000000 \n", "15 Imaging 0 1 7336.500000 \n", "16 Imaging 0 1 NaN \n", "17 Radial Velocity 0 1 335.100010 \n", "18 Radial Velocity 0 1 269.300000 \n", "19 Radial Velocity 0 1 479.100000 \n", "20 Radial Velocity 0 3 1078.000000 \n", "21 Radial Velocity 0 3 2391.000000 \n", "22 Radial Velocity 0 3 14002.000000 \n", "23 Imaging 0 1 11688.000000 \n", "24 Radial Velocity 0 1 4.230785 \n", "25 Radial Velocity 0 5 14.651520 \n", "26 Radial Velocity 0 5 44.417500 \n", "27 Radial Velocity 0 5 4825.000000 \n", "28 Radial Velocity 0 5 0.736539 \n", "29 Radial Velocity 0 5 262.000000 \n", "... ... ... ... ... \n", "3986 Radial Velocity 0 1 417.900000 \n", "3987 Radial Velocity 0 1 2502.000000 \n", "3988 Radial Velocity 0 1 594.900000 \n", "3989 Radial Velocity 0 1 428.500000 \n", "3990 Radial Velocity 0 1 903.300000 \n", "3991 Radial Velocity 0 2 415.200000 \n", "3992 Radial Velocity 0 2 964.600000 \n", "3993 Radial Velocity 0 1 511.098000 \n", "3994 Imaging 0 1 NaN \n", "3995 Radial Velocity 0 1 1285.000000 \n", "3996 Radial Velocity 0 1 357.800000 \n", "3997 Radial Velocity 0 2 530.320000 \n", "3998 Radial Velocity 0 2 3186.000000 \n", "3999 Radial Velocity 0 1 277.020000 \n", "4000 Radial Velocity 0 1 187.830000 \n", "4001 Radial Velocity 0 1 1630.000000 \n", "4002 Transit 0 2 6.267900 \n", "4003 Radial Velocity 0 1 3117.000000 \n", "4004 Radial Velocity 1 2 39.845800 \n", "4005 Radial Velocity 0 2 102.540000 \n", "4006 Radial Velocity 0 1 3.312457 \n", "4007 Radial Velocity 0 4 162.870000 \n", "4008 Radial Velocity 0 4 636.130000 \n", "4009 Radial Velocity 0 4 20.000000 \n", "4010 Radial Velocity 0 4 49.410000 \n", "4011 Radial Velocity 0 1 305.500000 \n", "4012 Radial Velocity 0 3 4.617033 \n", "4013 Radial Velocity 0 3 241.258000 \n", "4014 Radial Velocity 0 3 1276.460000 \n", "4015 Radial Velocity 0 1 136.750000 \n", "\n", " pl_orbpererr1 pl_orbpererr2 pl_orbperlim ... st_sperr st_splim \\\n", "0 0.320000 -0.320000 0.0 ... NaN 0.0 \n", "1 3.200000 -3.200000 0.0 ... NaN 0.0 \n", "2 0.230000 -0.230000 0.0 ... NaN 0.0 \n", "3 2.500000 -2.500000 0.0 ... NaN 0.0 \n", "4 1.000000 -1.000000 0.0 ... NaN 0.0 \n", "5 3.200000 -3.200000 0.0 ... NaN 0.0 \n", "6 NaN NaN NaN ... NaN 0.0 \n", "7 0.007800 -0.007700 0.0 ... NaN 0.0 \n", "8 2.100000 -4.500000 0.0 ... NaN 0.0 \n", "9 32.400000 -13.800000 0.0 ... NaN 0.0 \n", "10 NaN NaN NaN ... NaN 0.0 \n", "11 NaN NaN NaN ... NaN 0.0 \n", "12 NaN NaN NaN ... NaN 0.0 \n", "13 NaN NaN NaN ... NaN 0.0 \n", "14 2.000000 -2.000000 0.0 ... NaN NaN \n", "15 1934.500000 -584.000000 0.0 ... NaN 0.0 \n", "16 NaN NaN NaN ... NaN 0.0 \n", "17 2.500000 -2.500000 0.0 ... NaN 0.0 \n", "18 1.960000 -1.960000 0.0 ... NaN 0.0 \n", "19 6.200000 -6.200000 0.0 ... NaN 0.0 \n", "20 2.000000 -2.000000 0.0 ... NaN 0.0 \n", "21 100.000000 -87.000000 0.0 ... NaN 0.0 \n", "22 4018.000000 -5095.000000 0.0 ... NaN 0.0 \n", "23 6209.250000 -3287.250000 0.0 ... NaN 0.0 \n", "24 0.000036 -0.000036 0.0 ... NaN 0.0 \n", "25 0.000150 -0.000150 0.0 ... NaN 0.0 \n", "26 0.007300 -0.007300 0.0 ... NaN 0.0 \n", "27 39.000000 -39.000000 0.0 ... NaN 0.0 \n", "28 0.000007 -0.000007 0.0 ... NaN 0.0 \n", "29 0.510000 -0.510000 0.0 ... NaN 0.0 \n", "... ... ... ... ... ... ... \n", "3986 0.500000 -0.500000 0.0 ... NaN 0.0 \n", "3987 10.000000 -10.000000 0.0 ... NaN 0.0 \n", "3988 5.300000 -5.300000 0.0 ... NaN 0.0 \n", "3989 1.250000 -1.250000 0.0 ... NaN 0.0 \n", "3990 1.500000 -1.500000 0.0 ... NaN 0.0 \n", "3991 1.800000 -1.900000 0.0 ... NaN 0.0 \n", "3992 3.100000 -3.100000 0.0 ... NaN 0.0 \n", "3993 0.089000 -0.089000 0.0 ... NaN 0.0 \n", "3994 NaN NaN NaN ... NaN 0.0 \n", "3995 14.000000 -14.000000 0.0 ... NaN 0.0 \n", "3996 1.200000 -1.200000 0.0 ... NaN 0.0 \n", "3997 0.350000 -0.350000 0.0 ... NaN 0.0 \n", "3998 14.000000 -14.000000 0.0 ... NaN 0.0 \n", "3999 0.520000 -0.510000 0.0 ... NaN 0.0 \n", "4000 0.540000 -0.540000 0.0 ... NaN 0.0 \n", "4001 35.000000 -35.000000 0.0 ... NaN 0.0 \n", "4002 0.000460 -0.000460 0.0 ... NaN 0.0 \n", "4003 42.000000 -42.000000 0.0 ... NaN 0.0 \n", "4004 0.001500 -0.001400 0.0 ... NaN 0.0 \n", "4005 0.170000 -0.170000 0.0 ... NaN 0.0 \n", "4006 0.000007 -0.000007 0.0 ... NaN 0.0 \n", "4007 1.080000 -0.460000 0.0 ... NaN 0.0 \n", "4008 11.700000 -47.690000 0.0 ... NaN 0.0 \n", "4009 0.020000 -0.010000 0.0 ... NaN 0.0 \n", "4010 0.080000 -0.100000 0.0 ... NaN 0.0 \n", "4011 0.100000 -0.100000 0.0 ... NaN 0.0 \n", "4012 0.000023 -0.000023 0.0 ... NaN 0.0 \n", "4013 0.064000 -0.064000 0.0 ... NaN 0.0 \n", "4014 0.570000 -0.570000 0.0 ... NaN 0.0 \n", "4015 0.250000 -0.250000 0.0 ... NaN 0.0 \n", "\n", " st_lum st_lumerr1 st_lumerr2 st_lumlim st_age st_ageerr1 \\\n", "0 2.243 0.071 -0.085 0.0 NaN NaN \n", "1 NaN NaN NaN 0.0 NaN NaN \n", "2 1.763 NaN NaN 0.0 NaN NaN \n", "3 NaN NaN NaN 0.0 NaN NaN \n", "4 NaN NaN NaN 0.0 NaN NaN \n", "5 1.602 NaN NaN 0.0 NaN NaN \n", "6 -0.370 0.150 -0.150 0.0 0.005 NaN \n", "7 1.774 0.047 -0.053 0.0 6.920 4.830 \n", "8 1.164 0.003 -0.003 0.0 2.700 0.400 \n", "9 1.164 0.003 -0.003 0.0 2.700 0.400 \n", "10 -1.720 0.110 -0.110 0.0 0.120 0.010 \n", "11 -2.230 0.060 -0.060 0.0 0.035 0.005 \n", "12 NaN NaN NaN NaN 0.001 NaN \n", "13 NaN NaN NaN NaN 0.008 0.004 \n", "14 NaN NaN NaN NaN NaN NaN \n", "15 -3.480 0.350 -0.350 0.0 NaN NaN \n", "16 NaN NaN NaN NaN 0.120 0.010 \n", "17 NaN NaN NaN 0.0 NaN NaN \n", "18 NaN NaN NaN NaN 4.604 2.000 \n", "19 NaN NaN NaN NaN 9.490 1.760 \n", "20 NaN NaN NaN NaN 7.300 1.900 \n", "21 NaN NaN NaN NaN 7.300 1.900 \n", "22 NaN NaN NaN NaN 7.300 1.900 \n", "23 0.850 0.060 -0.060 0.0 0.020 0.006 \n", "24 NaN NaN NaN NaN 4.000 2.500 \n", "25 -0.235 0.010 -0.011 0.0 10.200 2.500 \n", "26 -0.235 0.010 -0.011 0.0 10.200 2.500 \n", "27 -0.235 0.010 -0.011 0.0 10.200 2.500 \n", "28 -0.235 0.010 -0.011 0.0 10.200 2.500 \n", "29 -0.235 0.010 -0.011 0.0 10.200 2.500 \n", "... ... ... ... ... ... ... \n", "3986 NaN NaN NaN 0.0 1.740 0.370 \n", "3987 NaN NaN NaN NaN 0.800 NaN \n", "3988 1.987 0.037 NaN 0.0 NaN NaN \n", "3989 NaN NaN NaN NaN NaN NaN \n", "3990 NaN NaN NaN NaN NaN NaN \n", "3991 1.850 0.035 -0.038 0.0 2.820 1.970 \n", "3992 1.850 0.035 -0.038 0.0 2.820 1.970 \n", "3993 NaN NaN NaN NaN NaN NaN \n", "3994 1.830 0.040 -0.040 0.0 0.030 0.120 \n", "3995 NaN NaN NaN 0.0 NaN NaN \n", "3996 1.797 NaN NaN 0.0 3.350 0.700 \n", "3997 2.090 NaN NaN 0.0 NaN NaN \n", "3998 2.090 NaN NaN 0.0 NaN NaN \n", "3999 1.850 NaN NaN 0.0 NaN NaN \n", "4000 1.709 NaN NaN 0.0 NaN NaN \n", "4001 2.140 NaN NaN 0.0 NaN NaN \n", "4002 0.160 0.006 -0.006 0.0 2.980 1.400 \n", "4003 NaN NaN NaN NaN 3.300 1.000 \n", "4004 0.232 0.011 -0.011 0.0 NaN NaN \n", "4005 0.232 0.011 -0.011 0.0 NaN NaN \n", "4006 NaN NaN NaN NaN NaN NaN \n", "4007 NaN NaN NaN 0.0 NaN NaN \n", "4008 NaN NaN NaN 0.0 NaN NaN \n", "4009 NaN NaN NaN 0.0 NaN NaN \n", "4010 NaN NaN NaN 0.0 NaN NaN \n", "4011 NaN NaN NaN NaN 1.220 0.760 \n", "4012 NaN NaN NaN NaN 5.000 NaN \n", "4013 NaN NaN NaN NaN 5.000 NaN \n", "4014 NaN NaN NaN NaN 5.000 NaN \n", "4015 1.839 NaN NaN 0.0 NaN NaN \n", "\n", " st_ageerr2 st_agelim \n", "0 NaN NaN \n", "1 NaN 0.0 \n", "2 NaN NaN \n", "3 NaN 0.0 \n", "4 NaN 0.0 \n", "5 NaN NaN \n", "6 NaN NaN \n", "7 -2.750 0.0 \n", "8 -0.400 NaN \n", "9 -0.400 NaN \n", "10 -0.010 NaN \n", "11 -0.005 NaN \n", "12 NaN NaN \n", "13 -0.003 NaN \n", "14 NaN NaN \n", "15 NaN NaN \n", "16 -0.010 NaN \n", "17 NaN 0.0 \n", "18 -2.000 NaN \n", "19 -1.760 NaN \n", "20 -1.900 NaN \n", "21 -1.900 NaN \n", "22 -1.900 NaN \n", "23 -0.006 NaN \n", "24 -2.500 NaN \n", "25 -2.500 NaN \n", "26 -2.500 NaN \n", "27 -2.500 NaN \n", "28 -2.500 NaN \n", "29 -2.500 NaN \n", "... ... ... \n", "3986 -0.370 NaN \n", "3987 NaN NaN \n", "3988 NaN NaN \n", "3989 NaN NaN \n", "3990 NaN NaN \n", "3991 -0.820 0.0 \n", "3992 -0.820 0.0 \n", "3993 NaN NaN \n", "3994 -0.010 NaN \n", "3995 NaN 0.0 \n", "3996 -0.700 NaN \n", "3997 NaN NaN \n", "3998 NaN NaN \n", "3999 NaN NaN \n", "4000 NaN NaN \n", "4001 NaN NaN \n", "4002 -1.300 0.0 \n", "4003 -1.000 NaN \n", "4004 NaN NaN \n", "4005 NaN NaN \n", "4006 NaN NaN \n", "4007 NaN NaN \n", "4008 NaN NaN \n", "4009 NaN NaN \n", "4010 NaN NaN \n", "4011 -0.760 NaN \n", "4012 NaN NaN \n", "4013 NaN NaN \n", "4014 NaN NaN \n", "4015 NaN NaN \n", "\n", "[4016 rows x 88 columns]" ] }, "execution_count": 4, "metadata": {}, "output_type": "execute_result" } ], "source": [ "planets\n", "# formatting here is sort of nice" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [], "source": [ "# skip this\n", "# how many entries are there? as an iterable\n", "#planets.index" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pl_hostnamepl_letterpl_namepl_discmethodpl_controvflagpl_pnumpl_orbperpl_orbpererr1pl_orbpererr2pl_orbperlim...st_sperrst_splimst_lumst_lumerr1st_lumerr2st_lumlimst_agest_ageerr1st_ageerr2st_agelim
011 Comb11 Com bRadial Velocity01326.030000.32-0.320.0...NaN0.02.2430.071-0.0850.0NaNNaNNaNNaN
111 UMib11 UMi bRadial Velocity01516.219973.20-3.200.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
214 Andb14 And bRadial Velocity01185.840000.23-0.230.0...NaN0.01.763NaNNaN0.0NaNNaNNaNNaN
314 Herb14 Her bRadial Velocity011773.400022.50-2.500.0...NaN0.0NaNNaNNaN0.0NaNNaNNaN0.0
\n", "

4 rows × 88 columns

\n", "
" ], "text/plain": [ " pl_hostname pl_letter pl_name pl_discmethod pl_controvflag pl_pnum \\\n", "0 11 Com b 11 Com b Radial Velocity 0 1 \n", "1 11 UMi b 11 UMi b Radial Velocity 0 1 \n", "2 14 And b 14 And b Radial Velocity 0 1 \n", "3 14 Her b 14 Her b Radial Velocity 0 1 \n", "\n", " pl_orbper pl_orbpererr1 pl_orbpererr2 pl_orbperlim ... st_sperr \\\n", "0 326.03000 0.32 -0.32 0.0 ... NaN \n", "1 516.21997 3.20 -3.20 0.0 ... NaN \n", "2 185.84000 0.23 -0.23 0.0 ... NaN \n", "3 1773.40002 2.50 -2.50 0.0 ... NaN \n", "\n", " st_splim st_lum st_lumerr1 st_lumerr2 st_lumlim st_age st_ageerr1 \\\n", "0 0.0 2.243 0.071 -0.085 0.0 NaN NaN \n", "1 0.0 NaN NaN NaN 0.0 NaN NaN \n", "2 0.0 1.763 NaN NaN 0.0 NaN NaN \n", "3 0.0 NaN NaN NaN 0.0 NaN NaN \n", "\n", " st_ageerr2 st_agelim \n", "0 NaN NaN \n", "1 NaN 0.0 \n", "2 NaN NaN \n", "3 NaN 0.0 \n", "\n", "[4 rows x 88 columns]" ] }, "execution_count": 6, "metadata": {}, "output_type": "execute_result" } ], "source": [ "planets.loc[0:3] #easy to grab subsets - here by label\n", "#planets.loc? #easy to grab subsets - here by label" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Index(['pl_hostname', 'pl_letter', 'pl_name', 'pl_discmethod',\n", " 'pl_controvflag', 'pl_pnum', 'pl_orbper', 'pl_orbpererr1',\n", " 'pl_orbpererr2', 'pl_orbperlim', 'pl_orbsmax', 'pl_orbsmaxerr1',\n", " 'pl_orbsmaxerr2', 'pl_orbsmaxlim', 'pl_orbeccen', 'pl_orbeccenerr1',\n", " 'pl_orbeccenerr2', 'pl_orbeccenlim', 'pl_orbincl', 'pl_orbinclerr1',\n", " 'pl_orbinclerr2', 'pl_orbincllim', 'pl_bmassj', 'pl_bmassjerr1',\n", " 'pl_bmassjerr2', 'pl_bmassjlim', 'pl_bmassprov', 'pl_radj',\n", " 'pl_radjerr1', 'pl_radjerr2', 'pl_radjlim', 'pl_dens', 'pl_denserr1',\n", " 'pl_denserr2', 'pl_denslim', 'ra_str', 'ra', 'dec_str', 'dec',\n", " 'st_dist', 'st_disterr1', 'st_disterr2', 'st_distlim', 'gaia_dist',\n", " 'gaia_disterr1', 'gaia_disterr2', 'gaia_distlim', 'st_optmag',\n", " 'st_optmagerr', 'st_optmaglim', 'st_optband', 'gaia_gmag',\n", " 'gaia_gmagerr', 'gaia_gmaglim', 'st_teff', 'st_tefferr1', 'st_tefferr2',\n", " 'st_tefflim', 'st_mass', 'st_masserr1', 'st_masserr2', 'st_masslim',\n", " 'st_rad', 'st_raderr1', 'st_raderr2', 'st_radlim', 'pl_massj',\n", " 'pl_massjerr1', 'pl_massjerr2', 'pl_massjlim', 'pl_rade', 'pl_radeerr1',\n", " 'pl_radeerr2', 'pl_radelim', 'pl_disc', 'pl_mnum', 'st_sp', 'st_spstr',\n", " 'st_sperr', 'st_splim', 'st_lum', 'st_lumerr1', 'st_lumerr2',\n", " 'st_lumlim', 'st_age', 'st_ageerr1', 'st_ageerr2', 'st_agelim'],\n", " dtype='object')" ] }, "execution_count": 7, "metadata": {}, "output_type": "execute_result" } ], "source": [ "planets.columns\n", "# names of columns" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "# skip\n", "#planets.loc[0:10][\"pl_orbeccen\"] # grab 1-10 entries, and print out the eccentricites of those entries\n", "# notice there are some NaN's -> these just don't have entries" ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "array(['11 Com', '11 UMi', '14 And', ..., 'tau Gem', 'ups And', 'xi Aql'],\n", " dtype=object)" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# what are the names of the unique host stars?\n", "planets[\"pl_hostname\"].unique() " ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "2994" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "planets[\"pl_hostname\"].nunique() # how many unique host stars?" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
pl_controvflagpl_pnumpl_orbperpl_orbpererr1pl_orbpererr2pl_orbperlimpl_orbsmaxpl_orbsmaxerr1pl_orbsmaxerr2pl_orbsmaxlim...st_sperrst_splimst_lumst_lumerr1st_lumerr2st_lumlimst_agest_ageerr1st_ageerr2st_agelim
count4016.0000004016.0000003.907000e+033.775000e+033.775000e+033941.0000002338.0000001535.0000001534.0000002501.000000...0.01335.0584.000000506.000000493.0000001401.02004.0000001865.0000001865.000000904.0
mean0.0027391.7724102.326363e+031.051761e+03-1.082432e+03-0.0005076.7028670.424368-0.429475-0.000400...NaN0.0-0.1365290.067524-0.0752880.04.1613212.991969-1.9475770.0
std0.0522711.1595061.171632e+055.948289e+045.968316e+040.03902081.2798635.7155575.9960490.019996...NaN0.01.1354860.0903810.1280680.02.1623732.1172591.0874340.0
min0.0000001.0000009.070629e-020.000000e+00-3.650000e+06-1.0000000.0044000.000000-200.000000-1.000000...NaN0.0-3.4800000.000000-1.4310000.00.0010000.001000-8.0000000.0
25%0.0000001.0000004.516936e+001.600000e-05-1.165000e-030.0000000.0570000.000800-0.0400000.000000...NaN0.0-0.6800000.027000-0.0770000.02.9500001.140000-2.7100000.0
50%0.0000001.0000001.193212e+019.500000e-05-9.600000e-050.0000000.1188400.003100-0.0031000.000000...NaN0.0-0.0245000.049000-0.0500000.04.0700002.500000-2.0200000.0
75%0.0000002.0000004.231980e+011.173500e-03-1.600000e-050.0000000.6800000.040000-0.0008080.000000...NaN0.00.4790000.079000-0.0280000.04.7900004.890000-1.0300000.0
max1.0000008.0000007.300000e+063.650000e+060.000000e+001.0000002500.000000200.0000000.0000000.000000...NaN0.03.0150000.8450000.0000000.023.00000026.000000-0.0010000.0
\n", "

8 rows × 79 columns

\n", "
" ], "text/plain": [ " pl_controvflag pl_pnum pl_orbper pl_orbpererr1 \\\n", "count 4016.000000 4016.000000 3.907000e+03 3.775000e+03 \n", "mean 0.002739 1.772410 2.326363e+03 1.051761e+03 \n", "std 0.052271 1.159506 1.171632e+05 5.948289e+04 \n", "min 0.000000 1.000000 9.070629e-02 0.000000e+00 \n", "25% 0.000000 1.000000 4.516936e+00 1.600000e-05 \n", "50% 0.000000 1.000000 1.193212e+01 9.500000e-05 \n", "75% 0.000000 2.000000 4.231980e+01 1.173500e-03 \n", "max 1.000000 8.000000 7.300000e+06 3.650000e+06 \n", "\n", " pl_orbpererr2 pl_orbperlim pl_orbsmax pl_orbsmaxerr1 \\\n", "count 3.775000e+03 3941.000000 2338.000000 1535.000000 \n", "mean -1.082432e+03 -0.000507 6.702867 0.424368 \n", "std 5.968316e+04 0.039020 81.279863 5.715557 \n", "min -3.650000e+06 -1.000000 0.004400 0.000000 \n", "25% -1.165000e-03 0.000000 0.057000 0.000800 \n", "50% -9.600000e-05 0.000000 0.118840 0.003100 \n", "75% -1.600000e-05 0.000000 0.680000 0.040000 \n", "max 0.000000e+00 1.000000 2500.000000 200.000000 \n", "\n", " pl_orbsmaxerr2 pl_orbsmaxlim ... st_sperr st_splim st_lum \\\n", "count 1534.000000 2501.000000 ... 0.0 1335.0 584.000000 \n", "mean -0.429475 -0.000400 ... NaN 0.0 -0.136529 \n", "std 5.996049 0.019996 ... NaN 0.0 1.135486 \n", "min -200.000000 -1.000000 ... NaN 0.0 -3.480000 \n", "25% -0.040000 0.000000 ... NaN 0.0 -0.680000 \n", "50% -0.003100 0.000000 ... NaN 0.0 -0.024500 \n", "75% -0.000808 0.000000 ... NaN 0.0 0.479000 \n", "max 0.000000 0.000000 ... NaN 0.0 3.015000 \n", "\n", " st_lumerr1 st_lumerr2 st_lumlim st_age st_ageerr1 \\\n", "count 506.000000 493.000000 1401.0 2004.000000 1865.000000 \n", "mean 0.067524 -0.075288 0.0 4.161321 2.991969 \n", "std 0.090381 0.128068 0.0 2.162373 2.117259 \n", "min 0.000000 -1.431000 0.0 0.001000 0.001000 \n", "25% 0.027000 -0.077000 0.0 2.950000 1.140000 \n", "50% 0.049000 -0.050000 0.0 4.070000 2.500000 \n", "75% 0.079000 -0.028000 0.0 4.790000 4.890000 \n", "max 0.845000 0.000000 0.0 23.000000 26.000000 \n", "\n", " st_ageerr2 st_agelim \n", "count 1865.000000 904.0 \n", "mean -1.947577 0.0 \n", "std 1.087434 0.0 \n", "min -8.000000 0.0 \n", "25% -2.710000 0.0 \n", "50% -2.020000 0.0 \n", "75% -1.030000 0.0 \n", "max -0.001000 0.0 \n", "\n", "[8 rows x 79 columns]" ] }, "execution_count": 11, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# if you are used to R at all, this is sort of like \"summary\" function, but basically giving some \n", "# summary statistics for the numerical data in our dataset\n", "planets.describe()\n", "# note that while things like the statistics for the orbital period are interesting\n", "# the \"mean\" of the pl_controvflag which is a flag if this planet is controversal or not\n", "# is essentially meaningless" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": [ "# skip\n", "# we can also search for subsets easily\n", "# we can look for only circular orbits\n", "# -> look for eccentricity == 0\n", "#planets.loc[planets[\"pl_orbeccen\"] == 0.0] " ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [], "source": [ "# skip\n", "# or very eccentric orbits\n", "#planets.loc[planets[\"pl_orbeccen\"] >= 0.9] " ] }, { "cell_type": "code", "execution_count": 9, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "(0.0, 0.95)" ] }, "execution_count": 9, "metadata": {}, "output_type": "execute_result" } ], "source": [ "# we can take min & maxes like with numpy arrays:\n", "# min and max of eccentricity\n", "planets['pl_orbeccen'].min(), planets['pl_orbeccen'].max()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "There are things like \"groupby\" and things that we aren't likely to get into right now, but will come across naturally later." ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# pandas also provides a matplotlib-like interface\n", "# to make quick plots to look at our data\n", "planets[\"pl_orbeccen\"].plot() # easy plots with pandas dataframes" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Note this ils like doing a `matplotlib` style plot, but now our plot is associated with our data.\n", "\n", "So we can see there are a lot of both zero and less very eccentric planets. \n", "\n", "Note also that there are a lot of empty spots - this indicates where there are \"NaN\"s - or non-entries.\n", "\n", "So, maybe this isn't what we want to know - we really want to know about the eccentricity *distribution* - so let's plot that:" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "iVBORw0KGgoAAAANSUhEUgAAAXoAAAD8CAYAAAB5Pm/hAAAABHNCSVQICAgIfAhkiAAAAAlwSFlzAAALEgAACxIB0t1+/AAAADl0RVh0U29mdHdhcmUAbWF0cGxvdGxpYiB2ZXJzaW9uIDMuMC4zLCBodHRwOi8vbWF0cGxvdGxpYi5vcmcvnQurowAAEcNJREFUeJzt3WuMXddZxvH/Q0zaENomTZqRZRtchIFGRG2jUQmqBNMaoSZBdT40qFWgTmVhqYQCqgUY+MD1QwoKoY1QwSKAW6WXECi22nCJ3BxxEQkkJI2bhCommGSwqUubGKZRAcPLh9kOQzzNbM+ci2ed/08anb3XWefs9e6ZPLOy5uztVBWSpHZ9zaQHIEkaLYNekhpn0EtS4wx6SWqcQS9JjTPoJalxBr0kNc6gl6TGGfSS1LgNkx4AwKWXXlpbt25d1Wu//OUvc+GFFw53QOuI9U93/eA5mOb6H3zwwX+tqlet1O+cCPqtW7fywAMPrOq1g8GAubm54Q5oHbH+6a4fPAfTXH+Sf+rTz6UbSWqcQS9JjTPoJalxBr0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklq3DlxZexaHP7nk9y491MTOfbRm6+dyHEl6Ww4o5ekxhn0ktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXEGvSQ1zqCXpMYZ9JLUuF5Bn+SiJHcl+fskjyf5ziSvTHJPkie6x4u7vknygSRHkjyS5MrRliBJejF9Z/TvB/6kqr4NeC3wOLAXOFRV24BD3T7A1cC27ms38MGhjliSdFZWDPokLwe+C7gdoKr+s6qeBXYA+7tu+4Hruu0dwIdq0X3ARUk2Dn3kkqRe+szovwn4AvC7SR5K8ttJLgRmquo4QPd4Wdd/E/D0ktfPd22SpAnoc5viDcCVwHuq6v4k7+f/lmmWk2Xa6oxOyW4Wl3aYmZlhMBj0GMqZZi6APVecWtVr12q1Yx6mhYWFc2IckzLt9YPnYNrr76NP0M8D81V1f7d/F4tB//kkG6vqeLc0c2JJ/y1LXr8ZOPbCN62qfcA+gNnZ2Zqbm1tVAbfdcYBbDk/mtvpHb5ibyHGXGgwGrPbctWDa6wfPwbTX38eKSzdV9S/A00m+tWvaDjwGHAR2dm07gQPd9kHgnd2nb64CTp5e4pEkjV/fqfB7gDuSnA88CbyLxV8SdybZBTwFXN/1vRu4BjgCPNf1lSRNSK+gr6qHgdllntq+TN8CblrjuCRJQ+KVsZLUOINekhpn0EtS4wx6SWqcQS9JjTPoJalxBr0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY0z6CWpcQa9JDXOoJekxhn0ktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXEGvSQ1rlfQJzma5HCSh5M80LW9Msk9SZ7oHi/u2pPkA0mOJHkkyZWjLECS9OLOZkb/pqp6XVXNdvt7gUNVtQ041O0DXA1s6752Ax8c1mAlSWdvLUs3O4D93fZ+4Lol7R+qRfcBFyXZuIbjSJLWoG/QF/BnSR5Msrtrm6mq4wDd42Vd+ybg6SWvne/aJEkTsKFnvzdW1bEklwH3JPn7F+mbZdrqjE6LvzB2A8zMzDAYDHoO5f+buQD2XHFqVa9dq9WOeZgWFhbOiXFMyrTXD56Daa+/j15BX1XHuscTST4BvAH4fJKNVXW8W5o50XWfB7Yseflm4Ngy77kP2AcwOztbc3NzqyrgtjsOcMvhvr+vhuvoDXMTOe5Sg8GA1Z67Fkx7/eA5mPb6+1hx6SbJhUlednob+F7gs8BBYGfXbSdwoNs+CLyz+/TNVcDJ00s8kqTx6zMVngE+keR0/49U1Z8k+VvgziS7gKeA67v+dwPXAEeA54B3DX3UkqTeVgz6qnoSeO0y7V8Eti/TXsBNQxmdJGnNvDJWkhpn0EtS4wx6SWqcQS9JjTPoJalxBr0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY0z6CWpcQa9JDXOoJekxhn0ktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXEGvSQ1zqCXpMb1Dvok5yV5KMknu/1XJ7k/yRNJPp7k/K79Jd3+ke75raMZuiSpj7OZ0f8Y8PiS/fcBt1bVNuAZYFfXvgt4pqq+Gbi16ydJmpBeQZ9kM3At8NvdfoA3A3d1XfYD13XbO7p9uue3d/0lSRPQd0b/68BPAv/T7V8CPFtVp7r9eWBTt70JeBqge/5k11+SNAEbVuqQ5PuAE1X1YJK5083LdK0ezy19393AboCZmRkGg0Gf8Z5h5gLYc8WplTuOwGrHPEwLCwvnxDgmZdrrB8/BtNffx4pBD7wReGuSa4CXAi9ncYZ/UZIN3ax9M3Cs6z8PbAHmk2wAXgF86YVvWlX7gH0As7OzNTc3t6oCbrvjALcc7lPG8B29YW4ix11qMBiw2nPXgmmvHzwH015/Hysu3VTVT1fV5qraCrwd+HRV3QDcC7yt67YTONBtH+z26Z7/dFWdMaOXJI3HWj5H/1PAe5McYXEN/vau/Xbgkq79vcDetQ1RkrQWZ7XmUVUDYNBtPwm8YZk+XwGuH8LYJElD4JWxktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXEGvSQ1zqCXpMYZ9JLUOINekhpn0EtS4wx6SWqcQS9JjTPoJalxBr0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY0z6CWpcQa9JDVuxaBP8tIkf5PkM0keTfILXfurk9yf5IkkH09yftf+km7/SPf81tGWIEl6MX1m9P8BvLmqXgu8DnhLkquA9wG3VtU24BlgV9d/F/BMVX0zcGvXT5I0ISsGfS1a6Ha/tvsq4M3AXV37fuC6bntHt0/3/PYkGdqIJUlnpdcafZLzkjwMnADuAf4BeLaqTnVd5oFN3fYm4GmA7vmTwCXDHLQkqb8NfTpV1X8Dr0tyEfAJ4DXLdesel5u91wsbkuwGdgPMzMwwGAz6DOUMMxfAnitOrdxxBFY75mFaWFg4J8YxKdNeP3gOpr3+PnoF/WlV9WySAXAVcFGSDd2sfTNwrOs2D2wB5pNsAF4BfGmZ99oH7AOYnZ2tubm5VRVw2x0HuOXwWZUxNEdvmJvIcZcaDAas9ty1YNrrB8/BtNffR59P3byqm8mT5ALge4DHgXuBt3XddgIHuu2D3T7d85+uqjNm9JKk8egzFd4I7E9yHou/GO6sqk8meQz4WJJfBh4Cbu/63w58OMkRFmfybx/BuCVJPa0Y9FX1CPD6ZdqfBN6wTPtXgOuHMjpJ0pp5ZawkNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklq3GQuKW3E1r2fmshxj9587USOK2l9ckYvSY0z6CWpcQa9JDXOoJekxhn0ktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXEGvSQ1zqCXpMYZ9JLUOINekhpn0EtS4wx6SWrcikGfZEuSe5M8nuTRJD/Wtb8yyT1JnugeL+7ak+QDSY4keSTJlaMuQpL01fWZ0Z8C9lTVa4CrgJuSXA7sBQ5V1TbgULcPcDWwrfvaDXxw6KOWJPW2YtBX1fGq+rtu+9+Bx4FNwA5gf9dtP3Bdt70D+FAtug+4KMnGoY9cktTLWa3RJ9kKvB64H5ipquOw+MsAuKzrtgl4esnL5rs2SdIE9P7HwZN8PfAHwI9X1b8l+apdl2mrZd5vN4tLO8zMzDAYDPoO5f+ZuQD2XHFqVa9dr5aeq4WFhVWfuxZMe/3gOZj2+vvoFfRJvpbFkL+jqv6wa/58ko1VdbxbmjnRtc8DW5a8fDNw7IXvWVX7gH0As7OzNTc3t6oCbrvjALcc7v37qglHb5h7fnswGLDac9eCaa8fPAfTXn8ffT51E+B24PGq+rUlTx0EdnbbO4EDS9rf2X365irg5OklHknS+PWZCr8R+EHgcJKHu7afAW4G7kyyC3gKuL577m7gGuAI8BzwrqGOWJJ0VlYM+qr6S5ZfdwfYvkz/Am5a47gkSUPilbGS1DiDXpIaZ9BLUuMMeklqnEEvSY0z6CWpcdN1SWkjtu791PPbe644xY1L9kft6M3Xju1YkobDGb0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY0z6CWpcQa9JDXOoJekxhn0ktQ4g16SGmfQS1LjDHpJapxBL0mNM+glqXErBn2S30lyIslnl7S9Msk9SZ7oHi/u2pPkA0mOJHkkyZWjHLwkaWV9ZvS/B7zlBW17gUNVtQ041O0DXA1s6752Ax8czjAlSau1YtBX1Z8DX3pB8w5gf7e9H7huSfuHatF9wEVJNg5rsJKks7faNfqZqjoO0D1e1rVvAp5e0m++a5MkTciw/3HwLNNWy3ZMdrO4vMPMzAyDwWBVB5y5YPEfyJ5W465/td+nUVlYWDjnxjRu034Opr3+PlYb9J9PsrGqjndLMye69nlgy5J+m4Fjy71BVe0D9gHMzs7W3NzcqgZy2x0HuOXwsH9frR97rjg11vqP3jA3tmP1MRgMWO3PTium/RxMe/19rDYhDgI7gZu7xwNL2n8kyceA7wBOnl7iURu27v3URI579OZrJ3JcqQUrBn2SjwJzwKVJ5oGfYzHg70yyC3gKuL7rfjdwDXAEeA541wjGLEk6CysGfVW946s8tX2ZvgXctNZBSZKGxytjJalxBr0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY2b3pvEaF35arde2HPFKW4c4W0ZvPWCWuCMXpIaZ9BLUuMMeklqnGv00ouY1G2Zwb8PaHic0UtS4wx6SWqcQS9JjTPoJalxBr0kNc6gl6TG+fFK6RzV96Odo74NxDj5kdLRcEYvSY0z6CWpcS7dSDpnrOZK5GEsXbW+ZDSSGX2StyT5XJIjSfaO4hiSpH6GHvRJzgN+A7gauBx4R5LLh30cSVI/o1i6eQNwpKqeBEjyMWAH8NgIjiVJa9b6zetGsXSzCXh6yf581yZJmoBRzOizTFud0SnZDezudheSfG6Vx7sU+NdVvnbd+1Hrn+r6wXOw3uvP+9b08m/s02kUQT8PbFmyvxk49sJOVbUP2LfWgyV5oKpm1/o+65X1T3f94DmY9vr7GMXSzd8C25K8Osn5wNuBgyM4jiSph6HP6KvqVJIfAf4UOA/4nap6dNjHkST1M5ILpqrqbuDuUbz3Mta8/LPOWb+m/RxMe/0rStUZfyeVJDXEe91IUuPWTdCvdFuFJC9J8vHu+fuTbB3/KEenR/3vTfJYkkeSHErS62NX60Xf22okeVuSStLUpzD61J/k+7ufgUeTfGTcYxylHj//35Dk3iQPdf8NXDOJcZ6zquqc/2Lxj7r/AHwTcD7wGeDyF/T5YeA3u+23Ax+f9LjHXP+bgK/rtt89bfV3/V4G/DlwHzA76XGP+fu/DXgIuLjbv2zS4x5z/fuAd3fblwNHJz3uc+lrvczon7+tQlX9J3D6tgpL7QD2d9t3AduTLHfx1nq0Yv1VdW9VPdft3sfi9Qut6PP9B/gl4FeAr4xzcGPQp/4fAn6jqp4BqKoTYx7jKPWpv4CXd9uvYJlrd6bZegn6PrdVeL5PVZ0CTgKXjGV0o3e2t5XYBfzxSEc0XivWn+T1wJaq+uQ4BzYmfb7/3wJ8S5K/SnJfkreMbXSj16f+nwd+IMk8i5/4e894hrY+rJf70fe5rUKvWy+sU71rS/IDwCzw3SMd0Xi9aP1Jvga4FbhxXAMasz7f/w0sLt/Msfh/c3+R5Nur6tkRj20c+tT/DuD3quqWJN8JfLir/39GP7xz33qZ0fe5rcLzfZJsYPF/3740ltGNXq/bSiT5HuBngbdW1X+MaWzjsFL9LwO+HRgkOQpcBRxs6A+yfX/+D1TVf1XVPwKfYzH4W9Cn/l3AnQBV9dfAS1m8B45YP0Hf57YKB4Gd3fbbgE9X95eZBqxYf7d08VsshnxL67OwQv1VdbKqLq2qrVW1lcW/Uby1qh6YzHCHrs/P/x+x+Ad5klzK4lLOk2Md5ej0qf8pYDtAktewGPRfGOsoz2HrIui7NffTt1V4HLizqh5N8otJ3tp1ux24JMkR4L1AM/+yVc/6fxX4euD3kzycpJn7C/Wsv1k96/9T4ItJHgPuBX6iqr44mREPV8/69wA/lOQzwEeBGxua6K2ZV8ZKUuPWxYxekrR6Br0kNc6gl6TGGfSS1DiDXpIaZ9BLUuMMeklqnEEvSY37X7qoNVO6dLg1AAAAAElFTkSuQmCC\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "planets[\"pl_orbeccen\"].hist()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "We can add labels and things like this like we'd do with `matplotlib` type plots, but the way we do it is a little different:" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "Text(0, 0.5, 'Number of observed planets with this eccentricity')" ] }, "execution_count": 17, "metadata": {}, "output_type": "execute_result" }, { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "myPlot = planets[\"pl_orbeccen\"].hist()\n", "myPlot.set_xlabel('Eccentricity')\n", "myPlot.set_ylabel('Number of observed planets with this eccentricity')" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Now we are getting some where - it seems like there are many near-circular orbits in the Kepler dataset!\n", "\n", "We can make the same sort of plot using `matplotlib` as well:" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": { "needs_background": "light" }, "output_type": "display_data" } ], "source": [ "# first, create an axis object\n", "fig, ax = plt.subplots(1, 1, figsize = (10, 10))\n", "\n", "# set this histogram to be on this ax object\n", "planets[\"pl_orbeccen\"].hist(ax=ax)\n", "\n", "# add labels with ax:\n", "ax.set_xlabel('Eccentricity')\n", "ax.set_ylabel('Number of Kepler Planets with an eccentricty')\n", "\n", "plt.show()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Plot Styles\n", "Now, lets play with something called the \"style\" of the plot. If you've used R, or come across it later in life you might/may have used the \"ggplot\" package. We can make plots in this style with Python too:" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "image/png": "\n", "text/plain": [ "
" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "with plt.style.context(\"ggplot\"):\n", " # first, create an axis object\n", " fig, ax = plt.subplots(1, 1, figsize = (10, 10))\n", "\n", " # set this histogram to be on this ax object\n", " planets[\"pl_orbeccen\"].hist(ax=ax)\n", "\n", " # add labels with ax:\n", " ax.set_xlabel('Eccentricity')\n", " ax.set_ylabel('Number of Kepler Planets with an eccentricty')\n", "\n", " plt.show() " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "What plot styles are available to us?" ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['seaborn-dark',\n", " 'seaborn-darkgrid',\n", " 'seaborn-ticks',\n", " 'fivethirtyeight',\n", " 'seaborn-whitegrid',\n", " 'classic',\n", " '_classic_test',\n", " 'fast',\n", " 'seaborn-talk',\n", " 'seaborn-dark-palette',\n", " 'seaborn-bright',\n", " 'seaborn-pastel',\n", " 'grayscale',\n", " 'seaborn-notebook',\n", " 'ggplot',\n", " 'seaborn-colorblind',\n", " 'seaborn-muted',\n", " 'seaborn',\n", " 'Solarize_Light2',\n", " 'seaborn-paper',\n", " 'bmh',\n", " 'tableau-colorblind10',\n", " 'seaborn-white',\n", " 'dark_background',\n", " 'seaborn-poster',\n", " 'seaborn-deep']" ] }, "execution_count": 20, "metadata": {}, "output_type": "execute_result" } ], "source": [ "plt.style.available" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "But what if we want to see how our plot would look with each of these styles? We could just make a bunch of plots OR we can play with this interactively with ipywidgets:" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [], "source": [ "import ipywidgets" ] }, { "cell_type": "code", "execution_count": 23, "metadata": {}, "outputs": [ { "data": { "application/vnd.jupyter.widget-view+json": { "model_id": "08dfc9f3c6e74d70a1d07e392393165c", "version_major": 2, "version_minor": 0 }, "text/plain": [ "interactive(children=(Dropdown(description='style', options=('seaborn-dark', 'seaborn-darkgrid', 'seaborn-tick…" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "# lets tell jupyter ipywidgets that we want to \n", "# mess around with the style of the plot\n", "@ipywidgets.interact(style = plt.style.available)\n", "def make_plot(style):\n", " with plt.style.context(style):\n", " # first, create an axis object\n", " fig, ax = plt.subplots(1, 1, figsize = (10, 10))\n", "\n", " # set this histogram to be on this ax object\n", " planets[\"pl_orbeccen\"].hist(ax=ax)\n", "\n", " # add labels with ax:\n", " ax.set_xlabel('Eccentricity')\n", " ax.set_ylabel('Number of Kepler Planets with an eccentricty')\n", "\n", " plt.show() \n", " \n", "# so now you can see that we get a little dropdown menu that lists\n", "# all the different styles!\n", "# **play with this a bit!!**" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Ok, but what did we just do? We made something in Jupyter interactive. we'll have a lot of opportunities to mess around with widgets in the next class. If you want, you can read more on the docs: https://ipywidgets.readthedocs.io/en/stable/examples/Using%20Interact.html\n", "\n", "Basically what ipywidgets.interact does is looks for a function with inputs and makes a little interactive option for those inputs, so like we did with the \"make_plot\" function above we can do for other things like change the value of a print statement:\n", "```\n", "@ipywidgets.interact(x=10)\n", "def f(x):\n", " print(\"my value = \" + str(x))\n", "```\n", "\n", "Note this is a little different to the format in the docs and you can use what you'd like the \"@\" symbol is a \"decorator\" and essentially its a way to sort of \"extend\" the interact function without modifying it to much.\n", "\n", "At any rate, the take away is that you can call it like this, or how they do it in the docs, its up to you!" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Another example: changing the histogram binning\n", "We can actually run multiple widgets to do many interactive things at the same time, for example, plot style and histogram binning:" ] }, { "cell_type": "code", "execution_count": 24, "metadata": {}, "outputs": [ { "data": { "application/vnd.jupyter.widget-view+json": { "model_id": "b421627e219a4ba8b38a42e565124fdd", "version_major": 2, "version_minor": 0 }, "text/plain": [ "interactive(children=(Dropdown(description='style', options=('seaborn-dark', 'seaborn-darkgrid', 'seaborn-tick…" ] }, "metadata": {}, "output_type": "display_data" } ], "source": [ "@ipywidgets.interact(style = plt.style.available, number_of_bins = range(1,20,1))\n", "def make_plot(style, number_of_bins):\n", " with plt.style.context(style):\n", " # first, create an axis object\n", " fig, ax = plt.subplots(1, 1, figsize = (10, 10))\n", "\n", " # set this histogram to be on this ax object\n", " planets[\"pl_orbeccen\"].hist(ax=ax, bins=number_of_bins)\n", "\n", " # add labels with ax:\n", " ax.set_xlabel('Eccentricity')\n", " ax.set_ylabel('Number of Kepler Planets with an eccentricty')\n", "\n", " plt.show() \n", " " ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "### Exercise\n", "Pick another variable besides eccentricity and repeat this exercise. Check out the header of the data file for lists of what the other parameters are.\n", "\n", "Bonus: Pick another plotting variable besides style (like color of bars) to change. You might want to check up on the parameters avaiable under the pandas python plot: https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.hist.html and general matplotlib plots: https://matplotlib.org/3.1.1/api/_as_gen/matplotlib.pyplot.hist.html\n", "\n", "Bonus: make the variable another ipywidget input\n", "\n", "Bonus: instead of a histogram, plot one variable vs. another\n", "\n", "Bonus: make a 2 panel plot with the ability to change different things on different plots" ] }, { "cell_type": "code", "execution_count": null, "metadata": {}, "outputs": [], "source": [] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.3" } }, "nbformat": 4, "nbformat_minor": 2 }