Friday, July 26, 2013

Field Directed Work Week 5

This was a successful week at inputting the “SS” dataset into IBM’s Analyst Notebook.  A shorter practice dataset was inserted into the tool test if the data can be imported.   Based on the fields and their definitions available from the dataset I created an attribute relationship  Once it went in cleanly, all 50,000 records were inserted.  A few glitches occurred so we looked at the raw data in Excel and upon determining null data causing column misalignment they were adjusted in Excel, we were able to formulate the entire set into a peacock view in Analyst Notebook  Here is a screenshot of what peacock data looks like.

SS Peacock


The relationships of the data stems from what the user wants to correlate to each other.  For example I created a fake dataset of name, birth date and gender in Excel.

Association Chart

I chose an association chart, but the user has many options on how they want the data to relate


Import Specification

I then determined how I wanted to correlate the data, in this case I wanted the name associated with the date associated with the gender


Male Female

Once the data is imported an individual male chart and a female chart are created showing the relationships of name to date of birth.


At the end of the week upon returning to Richland for a few days, I met with a co-worker to discuss next steps in massaging some twitter data to be inputted into the tools and how to deal with columns with null information.

Sunday, July 21, 2013

Happy 5th Anniversary, Charlottesville and the Grand Caverns

2013 is our fifth year anniversary and we’re spending it over here in Virginia with Lucy.  We decided to get out of town and go to Charlottesville. 


Here are this year’s Glassy Babies to add to our collection.  The Creme Brulee colors represent our trip to Utah last year with all the reds, orange, and rust colors of the landscape.  The Canary color is all the happiness we experienced last year with all of Lucy’s milestones and celebration of another fun year together.  The Hudson colors represent the sunsets we experienced in Kauai last year.  The colors include blues, purples and hues of rust.  Last year was a great year!


Charlottesville is a super cute town where University of Virginia, Monticello and lots of Virginia history exists.  We stayed in the Boars Head Inn for two nights and it was phenomenal.






Our fancy breakfast, our first time having REAL maple syrup.  Yummy!


Monticello is Thomas Jefferson’s home.  It is an engineering marvel.  He was such a forward thinker of his time, from his agriculture research, to his weather predicting, to his water collection methods and clock, calendar contraption.  He also had a HUGE library which was really neat to see.  Monticello is on the back of a nickel.


IMG_0867 IMG_0868

IMG_0870 IMG_0873





IMG_0875 IMG_0883


Downtown Charlottesville

IMG_0889 IMG_0887



Grand Caverns

We went to the Grand Caverns on our way to the Shenandoah Mountains.  It was really neat to see. Lucy loved it too.

IMG_0894 IMG_0910







Friday, July 19, 2013

Field Directed Work Week 4

For the visualization tools I’m learning about this summer my mentors want to use a variety of datasets to be viewed in the same tools.  But each tool uses different formats of the data.  Not all datasets come from the same source types.  This can be challenging because of the limitations of the output of the data and the ingestion process of the tool.

One of the methods of obtaining data is via RSS feeds.  These feeds can be exported in XML format.  Searches in library databases are historical data.  RSS Feeds from the news, twitters, or blogs are live sources of information.   I spent quite a bit of time trying to set up my feed from Open Source Center, inputting into various RSS feed readers and didn’t have any success.  My gut feeling is that the information is probably not publicly available or something is wrong on the Open Source Center’s end or you have to authenticate to be able to view the documents.  When or if the RSS feeds work, we will need templates for the XML conversions. 

This week I installed Tableau Software on my computer, watched a lot of the tutorials, ingested some data and played around with the tool.   Tableau Software’s mission is,”Help people see and understand data”.  The tool produces visually stunning graphics and is a very business and numbers oriented tool.  Here are some examples of what Tableau Software can output:




Last quarter in LIS588 I had actually profiled and done a presentation on Tableau Software.  Here is the link to the Prezi presentation.   I had made datasets that included text, budget numbers, varying keyword columns of information and dates to see what the ingestion process is and how the tool handles the information.   It definitely is focused on the business analytics rather than text analysis which is different than IN-SPIRE and Tool “B”.

This week I also transferred dataset “SS” into Tool “B” and into IN-SPIRE and compared and contrasted the differences.  I had a discussion with a co-worker on converting XML files to CSV to import dataset “SS” into Analyst Notebook and Tableau Software.  We also discussed converting the Web of Science and IEEE Xplore datasets to a more usable format in excel (CSV).

Sunday, July 14, 2013

U.S. Botanic Garden and Natural History Museum

We decided to go visit the U.S. Botanic Garden because a co-worker told us that the Corpse flower would be in bloom.  It only blooms for 24-48 hours and has a smell of putrid, rotting flesh.  It is HUGE.  Unfortunately when we got there it was not blooming, but we did get to see a lot of the other beautiful plants.



IMG_0765 IMG_0785



 IMG_0778 IMG_0776

IMG_0786 IMG_0781

 IMG_0791 IMG_0790




Afterwards we stopped at the Natural History Museum to see the dinosaurs, fossils and other exciting things.