SmartDataCollective

  • IEEE Big Data Conference 2017 to Highlight Challenges, Opportunities
    Since 2013, the Institute of Electrical and Electronics Engineers has held annual big data conferences to highlight changes and opportunities in a rapidly growing field. The IEEE Big Data Conference of 2017 is scheduled to be held from December 11-14…
    - 3 days ago 23 Jun 17, 12:15pm -
  • 10 of the Top Marketing BI Software Options
    Business can be complicated sometimes. It’s not always easy to keep track of all the data and information we deal with, and with so much of what we do now is conducted digitally, it can get frustrating trying to track everything in an Excel spreads…
    - 3 days ago 23 Jun 17, 12:00am -
  • The Race for 5G Is the Race for Data Dominance
    Have you noticed how often the phrase “by the year 2020” comes up? In the tech sphere, many are heralding 2020 as the mile marker for various innovations. One central game-changer in this respect is 5G. 5G is the fifth generation network that wil…
    - 4 days ago 22 Jun 17, 7:45pm -
  • Using Big Data to Anticipate and Prepare for Life Disruptions
    Big data is helping people plan for unexpected interruptions in life. Although more predictive analytics models are developed for businesses, they can be used by everyday people as well. Consumers are using big data to deal with unplanned financial p…
    - 5 days ago 21 Jun 17, 2:08pm -
  • The Direst Security Breaches of 2017 and How Data Centers Are Responding
    Cybersecurity is becoming a tremendous concern. By 2021, security breach cost will exceed $6 trillion a year. A number of reset security breaches show these predictions may actually be too conservative. Worst Security Breaches of 2017 Cybersecurity a…
    - 5 days ago 20 Jun 17, 9:34pm -

AirBnB - Nerds

  • Writing fast, deterministic and accurate Android Integration tests
    Introducing OkReplay — record and replay OkHttp network interaction in your tests.At Airbnb, shipping a high quality product is of utmost importance. To accomplish this goal, we use automated testing to catch bugs before they reach our users.…
    - 4 days ago 21 Jun 17, 11:18pm -
  • Unlocking Test Performance — Migrating from Mocha to Jest
    Overview: Airbnb migrated from Mocha to Jest. Running our test suite with Mocha took 12+ minutes. In CI with our beefy build machines (32 cores) we’re able to run the entire Jest suite in 4 minutes 30 seconds.We’d been using Mocha at Airbnb sinc…
    - 11 days ago 15 Jun 17, 8:20pm -
  • How Airbnb Democratizes Data Science With Data University
    By Jeff Feng, Erin Coffman & Elena GrewalIntroductionData is essential to us at Airbnb. We characterize data as the voice of our users at scale. Thus, data science plays the role of an interpreter — we use data and statistics to understand our…
    - 23 days ago 3 Jun 17, 4:26am -
  • Selection Bias in Online Experimentation
    A Method for Winner’s Curse in A/B TestingOverview: In online experimentation platforms, we choose the experiments with significant successful results to launch to the product. When estimating the aggregated impact of the launched features, we inv…
    - 27 days ago 30 May 17, 4:40pm -
  • Rearchitecting Airbnb’s Frontend
    Overview: We recently rethought the architecture for the JavaScript side of our codebase at Airbnb. This post will look at (1) the product drivers that precipitated the changes, (2) the steps we took to move away from our legacy Rails solutions, and…
    - 41 days ago 16 May 17, 4:45pm -

Data Center Knowledge

  • Top Five Data Center Stories – Week of June 19
    Here are the most popular stories that appeared on Data Center Knowledge this week: Vapor IO to Sell Data Center Colocation Services at Cell Towers – Expecting development of the Internet of Things to drive demand for edge data centers that aggreg…
    - 1 day ago 24 Jun 17, 9:30pm -
  • Energy Department Awards $258 Million to Develop Exascale Supercomputers
    The Department of Energy (DOE) has awarded $258 million to six U.S. tech companies to build the country’s first exascale supercomputer – a move designed to help the United States regain its supercomputing dominance, but also to improve the nation…
    - 2 days ago 23 Jun 17, 8:37pm -
  • Planning for the New Windows Server Cadence
    The next version of Windows Server will let you run Linux containers using Hyper-V isolation (and connect to them with bash scripts), encrypt network segments on software-defined networks and deploy the Host Guardian Service as a Shielded VM rather t…
    - 3 days ago 23 Jun 17, 8:21pm -
  • Google Will Stop Reading Your Emails for Gmail Ads
    Google is stopping one of the most controversial advertising formats. Read More
    - 3 days ago 23 Jun 17, 4:54pm -
  • Packet, Qualcomm to Host World’s First 10nm Server Processor in Public Cloud for Developers
    Packet, a bare metal cloud for developers, announced that it will collaborate with Qualcomm Datacenter Technologies, Inc. to introduce the latest in server architecture innovation on the 48-core Qualcomm Centriq 2400 processor. The New York City-bas…
    - 3 days ago 23 Jun 17, 3:21pm -

The Unofficial Google Data Science Blog

  • Our quest for robust time series forecasting at scale
    by ERIC TASSONE, FARZAN ROHANIWe were part of a team of data scientists in Search Infrastructure at Google that took on the task of developing robust and automatic large-scale time series forecasting for our organization. In this post, we recount how…
    - 69 days ago 18 Apr 17, 12:02am -
  • Attributing a deep network’s prediction to its input features
    By MUKUND SUNDARARAJAN, ANKUR TALY, QIQI YANEditor's note: Causal inference is central to answering questions in science, engineering and business and hence the topic has received particular attention on this blog. Typically, causal inference in data…
    - 13 Mar 17, 8:54pm -
  • Causality in machine learning
    By OMKAR MURALIDHARAN, NIALL CARDIN, TODD PHILLIPS, AMIR NAJMIGiven recent advances and interest in machine learning, those of us with traditional statistical training have had occasion to ponder the similarities and differences between the fields. M…
    - 1 Feb 17, 1:55am -
  • Practical advice for analysis of large, complex data sets
    By PATRICK RILEYFor a number of years, I led the data science team for Google Search logs. We were often asked to make sense of confusing results, measure new phenomena from logged behavior, validate analyses done by others, and interpret metrics of…
    - 1 Nov 16, 3:18am -
  • Statistics for Google Sheets
    By STEVEN L. SCOTTBig data is new and exciting, but there are still lots of small data problems in the world. Many people who are just becoming aware that they need to work with data are finding that they lack the tools to do so. The statistics app f…
    - 30 Sep 16, 4:06pm -

IBM BigData and Analytics Hub

  • Leveraging event-driven systems for IoT with high-speed data ingestion
    It seems that we’re reaching the point where the Internet of Things (IoT) is moving from the domain of enthusiastic early-adopters to the more challenging, more profitable territory of mainstream enterprise technology. Event-driven architectures ar…
    - 4 days ago 22 Jun 17, 1:53pm -
  • Empowering a new generation of developers with enterprise-class databases
    If you read a lot of development blogs nowadays, you’ll probably notice a common theme: developers don’t want to deal with databases. They want to focus on designing, building, testing, and deploying applications that deliver value to the busines…
    - 4 days ago 22 Jun 17, 1:36pm -
  • How to take the next step in Information Governance now
    A decade ago, governance was dictated and enacted by a select group of people. Today, while the principles of governance are largely owned by the same select group of people, everyone has a hand and shared responsibility in the enactment and fulfillm…
    - 4 days ago 22 Jun 17, 1:00pm -
  • Overcome your data silos: Learn how in Munich
    Big data isn’t just getting bigger. It’s getting more valuable. As companies work to unlock more value from their data, one of the biggest challenges to address is disconnected data silos. Big companies don’t have one data lake, they have data…
    - 5 days ago 21 Jun 17, 6:34pm -
  • Start Test Driving Data Governance
    No matter what site you search, it’s pretty clear that self service data is a top trend in the data market today. The knowledge and insight that we can obtain from data is truly a secret weapon. But the challenge is making the data available while…
    - 5 days ago 20 Jun 17, 8:36pm -

Predictive Analytics