ePubs are best read on MacOS with Books, Windows with Microsoft Edge, or Calibre, and Linux with Calibre. In Future we will provide HTML and PDF versions fore those that do not use ePuB readers much.
- Version: Fall2018
- Editors: Gregor von Laszewski, Geoffrey C. Fox, Fugang Wang
- Classes: E516, E416, B649
- Description: This document contains an Introduction to Cloud Computing augmented with many practical technical examples. It show cases how to use virtual machines, containers, and function as a Service. Common platforms such as Hadoop are also introduced. Most of the examples are provided in Python.
- Comments: This is the class that you want to take to learn about Cloud Computing in detail. It is a technical class, requiring you typically to do three assignments. A paper about a cloud technology, a project related to clouds.
- Version: Early alpha
- Editors: Gregor von Laszewski
- Classes: E516, E416, B649
- Description: This document contains an Introduction to Cloud Computing augmented with many practical technical examples specifically targeted for the Raspberry PI. PI’s are a good way to model cloud environments with real hardware. Most of the examples are provided in Python.
- Comments: This set of notes enhances the Cloud Computing notes while focussing on Raspberry Pis and IoT. As part of this your project will be to create a Raspberry PI Cloud cluster. Residential students have access to 200 Raspberry PI’s
- Version: Fall2018
- Editors: Geoffrey C. Fox, Gregor von Laszewski, Fugang Wang
- Classes: e534, I523, I423
- Description: This document contains an Introduction to Big dada and analytics. It focusses on a large number of applications.
- Comments: Although this class also is focussing on a project, Students have the ability to chose a report only option with reduced grade. This is an introductory class. The project is related to big data.
- Version: Fall2018
- Editors: Gregor von Laszewski
- Classes: e534, I523, I423
- Description: This collection contains a very large number of technologies related to Cloud Computing and Big Data. It serves as a catalog that introduces students to technologies so they can gain an overview of some technologies.
- Comment: This document is a supplemental document to our lecture notes and may help in conducting additional research on technologies students are interested in or could be building the computational framework for projects. The write-ups are provided by former students of the i523 and i524 classes.
- Version: Fall2018
- Editors: Gregor von Laszewski
- Classes: all
- Description: This document describes how to use some easy scientific writing skills to communicate your findings in a report format as required for the class. Topics include plagiarism, emacs, and markdown.
- Comment: Please check with your class. Do not underestimate the power of tools other than MSWord. Structured text writing is more than making your content look pretty. Cloud Computing and big data is naturally also about communicating your research. We teach you how.
- Version: Fall2018
- Editors: Gregor von Laszewski
- Classes: all
- Description: This document describes how to use some easy scientific writing skills and communicate them with the help of LaTeX.
- Comment: please check with your class if it is allowed to use LaTeX. LaTeX is actually trivial with our documentation. However, we had students in the past that made things unnecessarily complicated and also manipulated our default format. Due to this reason we started in Fall 2018 to experiment with markdown as replacement to LaTeX
We have a large list of bibtex resources. Please feel free to reuse them, However, please acknowledge us in your papers you write with them.
Link | Class | Description |
---|---|---|
bibtex
|
all | BibTeX files directory I as used in the books ePub |
bibtex
|
all | BibTeX files directory II as used in the Cloud Technologies ePub |
Link | Class | Description |
---|---|---|
Mooc
|
2015 | Big Data Applications and Analytics |
Mooc
|
2015 | GE Special Class |
Mooc
|
2014 | Big Data Open SOurce Software and Projects |
Mooc
|
2014 | Big Data Applications and Analytics II |
Mooc
|
2014 | Big Data Applications and Analytics (Fall) |
Mooc
|
2014 | Big Data Applications and Analytics (Spring) |
The following Proceedings summarize a technology in Cloud Computing or Big Data
- Version: Fall 2017
- Editors: Gregor von Laszewski
Classes: e534, I523, I423
Description: This document contains contributed 2 summary page papers by students of the class. This includes 62 papers in application areas including buisiness, energy, environment, government, health, machine learning, media, security, sports, transportation and theory in general. In addition we have several papers that discuss some of the technologies, including DevOps, Spark, HPC, Social Media, PaaS NoSQL, Aws, and Docker.
Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Fall 2017
- Editors: Gregor von Laszewski
Classes: e534, I523, I423
Description: This document contains contributed 2 summary page papers by students of the class. This includes additional 62 papers in application areas including buisiness, energy, environment, government, health, machine learning, media, security, sports, transportation and theory in general. In addition we have several papers that discuss some of the technologies, including MQTT, NoSQL, AWS, Spark, MongoDB, Telemetry, IoT and others please ignore them.
Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Spring 2017
- Editors: Gregor von Laszewski
- Classes: e516
- Description: This document contains contributed 2 summary page papers by students of the class. This includes about 45 papers with introductions to cloud and big data technologies. The topics include: Allegro Graph, Ansible, Apache Airavata , Apache Beam Google Cloud Dataflow , Apache Drill, Apache Lucene, Apache Samza, Apache Spark, Apache Sqoop, Azure Data Factory, Azure Machine Learning, CDAP Cask Data Application Platform, CUBRID RDBMS, CoreOS, Couchbase Server: A Usable Overview , Docker Container, Docker Machine, Docker Swarm, File Transfer Protocol, H2O, Flume, Globus Toolkit, Google Bigtable , Google Cloud storage, Google Dremel, Google Fusion Table, H-Store, HTCondor, HadoopDB, JMS, Kubernetes, LDAP, Lustre File System, MLlib, MongoDB, MySQL, Netty vs ZeroMQ in Realtime Analytics, OpenNebula Project, Oracle PGX, Pentaho, RabbitMQ, Tajo, Triana, Twister, Twitter, Xen: A bare metal hypervisor , vCloud and vSphere
- Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Spring 2017
- Editors: Gregor von Laszewski
- Classes: e516
- Description: This document contains contributed 2 summary page papers by students of the class. This includes about 40 papers with introductions to cloud and big data technologies. The topics include: Juju, ASKALON, AWS Lambda, Amazon Elastic Beanstalk Shree, Amazon Kinesis, Apache Apex, Apache Avro, Apache Crunch, Apache Flink, Apache MRQL, Apache Mahout, Apache Ranger, Apache Spar, Apache Tez, Apache Thrift, Berkeley DB, Cisco Intelligent Automation, Dryad , Google BigQuery Google Cloud DNS, Graylog, HUBzero, Heroku, Hyper-V, InCommon, Jupyter Notebook vs Apache Zeppelin, KeystoneML, LMDB, Memcached, Naiad, Neo4J, OpenCV, OpenStack Nova, Pivotal HD/HAWQ , Pivotal Web Services, Retainable Evaluator Execution Framework, Robot Operating System (ROS), SciDB, Tao, Terraform, YARN,
- Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Spring 2018
- Editors: Gregor von Laszewski
- Classes: e516
- Description: This document contains contributed 2 summary page papers by students of the class. This includes additional 40 papers with introductions to cloud and big data technologies. The topics include: AWS CloudTrail, Amazon AWS EC2, Amazon EMR, Amazon RDS, Amazon Redshift, Amazon S3, Apache CloudStack, Apache Flink , Apache Mesos and Mesosphere, Apache Milagro , Apache Spark, Azure Cosmos DB , Blockchain and Distrubuted Ledger Technology, Cloud AutoML, Conspectus of Firebase Cloud Services, Domo, ELK Stack, Google Compute Engine , GraphQL, HBase, HPCC Systems, Hadoop, Hadoop Distributed File System, Heroku Cloud Platform, IBM Big Replicate, IBM Cloud, Informatica Intelligent Cloud Services, Jaspersoft, KNIME - Konstanz Information Miner, Morpheus, Openchain, Oracle NoSQL, Orange, Power BI, Puppet, Real Time Stream Processing , SETI@Home, Security Mechanisms in Cloud Computing, Stardog, Synthetic Data Vault, TensorFlow, Twilio, XGBoost
- Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Fall 2018 (alpha)
- Editors: Gregor von Laszewski
- Classes: e516, i523
Description: This is an early draft
Comment: There may be some incomplete contributions in the proceedings, please ignore them.
- Version: Spring 2018 (alpha)
- Editors: Gregor von Laszewski
- Classes: e516, i523
Description: This is an early draft
Comment: There may be some incomplete contributions in the proceedings, please ignore them.
Link | Class | Title |
---|---|---|
PDF
|
Fall 2017 | Use Cases in Big Data Software and Analytics Vol. 3, Gregor von Laszewski |
PDF
|
Spring 2017 | Big Data Projects, Vol. Spring 2018, Gregor von Laszewski |
PDF
|
Spring 2018 | Cloud and Big Data Projects, Vol 9, Gregor von Laszewski |
A command line interface to the use of comets virtual cluster interface.
A program to burn a fully operational abd configured image on an SD card for Raspberry PI clusters.
Big Data REST INterfaces to build generalized Big Data Architectures.
A multi-cloud GraphQL based interface.
A python interface to HPC batch systems.
A multi-cloud client interface for virtual machine management.
An ongoing reimplementation of the multi-cloud client interface for virtual machine management.
A Python module to easily create dynamic command shells from docopts enabled plugins.
Simple tool to manage a number of resources.
A reservation framework for resources that can be reserved via a REST servics.
A simple python library to execute shell commands on remote computers either in sequence or in parallel..
An virtual cluster deployment managed with Slurm to use it for batch processing in clouds.
“This is the best class I have taken …”
or
“I really enjoyed taking this class and having maximum flexibility to schedule the lectures. …”
or
“The lessons learned from this class were adopted within my company. …”
or
“I wanted to sincerely thank you for all the guidance you provided in this course. My learning in cloud computing has enhanced a lot because of this course and also because of your continuous guidance. … “
or
“Thanks to the material you thauhgt I got a job at Intel. …”