Research Computing group expands Advanced Computing support

On March 17th, Dr. Juan Carlos Zuñiga-Anaya joined us in Research Computing in Client Services as our Advanced Computing Research Analyst.

photo

Juan has been working in academic environments for several years in the fields of scientific computing, high performance computing, and scientific software development, including time as an assistant professor in Mathematics/Computer Science at the University of Guadalajara. He has a broad education, including a MSc in Electrical Engineering, specializing in applied Computer Science, and a PhD in Systems Theory, with a specialization in computational mathematics.

Juan will be working in our group focusing on advanced computing/HPC and scientific software. If you need help with optimizing your code, working with scientific software such as MATLAB,  using HPC research infrastructure, or just want to chat about your research, please contact Juan and the rest of the team at research_computing@usask.ca.

Big Data Hubris

Scientific American recently published an interesting article on their Observation Blog, titled Why Big Data Isn’t Necessarily Better Data. It nicely highlights one of the pitfalls of implementing Big Data analytics, namely, believing your data and/or data analysis are better than they are.   The example used in the article is Google Flu Trends (GFT) which seek to use Google’s search data to track outbreaks of influenza.  The results have been mixed, with results being similar to the CDC tracking data but overestimating the prevalence of flu in 100 of 108 weeks in 2011-2013 which one study examined. Google itself has found that the GFT data is highly susceptible to media coverage, which may explain some of these issues.

The main takeaway from this article is to be conscious of factors that could affect the quality of both your data and your analysis. This quote sums it up nicely:

Big data hubris is the “often implicit assumption that big data are a substitute for, rather than a supplement to, traditional data collection and analysis.”

Big data is a useful tool, but like all tools it must be used properly.  Some food for thought.

Sharing Files from Silo with Globus Connect Server

I’ve previously mentioned Globus, the service that allows you to transfer files to/from/between WestGrid sites from a handy web application. That article can be found here.

Compute Canada recently started rolling out a new service from Globus, called Globus Connect Server, that further enhances this application. The main benefit from the new Globus Connect Server is file sharing.  With this, you will be able to share files from any WestGrid site and make them available to anyone, anywhere, as long as they have a free Globus account.  It is currently only configured to share from Silo, however.

Continue reading

SuperComputing Conference Report

In November, I attended the Supercomputing conference in Denver, CO. The theme of this year’s conference was ‘HPC Everywhere’. Hundreds of presentations were given on all aspects of high-performance computing, from traditional HPC to big data and fast networking. All the major players in the HPC space, and many of the smaller ones had exhibits making for a lot to see in just a few days.

Continue reading

Using Globus

Data volume is growing by leaps and bound; more and more data is being generated every day that needs to be computed, analyzed, and stored. While traditional data transfer tools like FTP, scp, sftp and rsync work well for local or small transfers, transferring big data long distances is still slow and prone to interruption.

Fortunately, it’s now easier than ever to get your data onto WestGrid by using Globus.  This service is provided by the University of Chicago, and allows quick and easy file transfers to and from any Globus-enabled system.  By using the free Globus Connect program, this includes your desktop.

Continue reading