Return to flip book view

Data Archiving in the PremCloud

Page 1

Data Archiving in the Prem Cloud Presented by Capax Discovery 2018 Why is data so valuable What is big data What is data archiving What is Capax Archive Solutions 3 0 What is the Prem Cloud In this age of technology driven enlightenment data is our competitive currency Buried within raw information generated in mind boggling volumes by transactional systems social media search engines and countless other technologies are critical strategic customer and operational insights that once illuminated by analytics can validate or clarify assumptions inform decision making and help chart new paths to the future Deloitte Insights CAS 3 0 has been built to archive data better than any other solution in the world Our unified archiving program provides the discerning aggregation secure archiving and total access to all of an organization s structured unstructured and dark data But more importantly our solution goes beyond powerful archiving to give our clients total control and ownership of all their data in our various PremCloud Hosting environments Tom Thomson CEO Capax Discovery

Page 2

Why is data so valuable In the vortex of the information age data is king Data has now eclipsed both gold and oil as the most valuable asset in the world And data is becoming more valuable every day The amount of data the world produces is growing at a staggering rate doubling every two years and big data is leading the charge Smartphones and the internet have made data abundant ubiquitous and far more valuable This enormous increase in the amount of data being created collected stored and archived by organizations around the world is unrelenting and the ability to access and analyze this data is quickly becoming critically important and the organizations that manage and utilize this massive amount of available data most adroitly will undoubtedly have a significant competitive advantage Bolstered by technological advancements in Big Data and analytics organizations are now entering a challenging competitive environment where enterprise data is recognized and utilized as an asset capable of transforming the way they operate and profitize information But as valuable as big data is today the future promises to utilize current and legacy data more effectively than anything even remotely possible today Quantum computing is expected to be able to search very large unsorted data sets to uncover patterns or anomalies extremely quickly It might be possible for the quantum computers to access all items in your database at the same time to identify these similarities in seconds While this is theoretically possible today it only happens with a parallel computer looking at every record one after another so it takes an incredible amount of time and depending on the size of the data set it might never happen The promise is that quantum computers will allow for quick analysis and integration of our enormous data sets which will improve and transform our machine learning and artificial intelligence capabilities Bernard Marr Forbes But data s value transcends business it has an impact on every aspect of our lives In healthcare data provides personalized medicine and prescriptive analytics as well as automated external and internal reporting of patient data And for all of our critical infrastructure categories such as energy government transportation travel and communications data is transforming how they operate as well as putting them all at escalating risk Data is no longer the domain of tech companies or IT departments it is fast becoming a centerpiece of corporate value creation more generally Today most organizations are data driven to one degree or another Data contributes not only to brand equity but to what constitutes product and service delivery in globally connected and hyper competitive markets Failure to accurately quantify the enterprise value of data EvD may therefore woefully undervalue the importance of cyber security investments as well as the face values typically applied to cyber insurance policies Dante Disparte and Daniel Wagner The question this data revolution presents isn t whether or not data is critically valuable but who will utilize data most effectively

Page 3

What is big data It s undeniable how fast corporate data is growing There seems to be no limit to not just this idea of big data but to the growth rate of it as well Most charts show accelerating growth typically in an exponential fashion And these massive collections contain large amounts of dark structured and unstructured data What does all of this mean Big data sets are so voluminous and complex that traditional data processing application software is inadequate to deal with them Big data challenges include capturing data data storage data analysis search sharing transfer visualization querying updating and information privacy Data sets grow rapidly because they are increasingly gathered by numerous information sensing Internet of Things devices such as mobile devices software logs cameras microphones radio frequency identification RFID readers and wireless sensor networks IDC estimates that by 2020 business transactions on the internet business to business and business to consumer will reach 450 billion per day More than 5 billion people are calling texting tweeting and browsing on mobile phones worldwide Big data is a top business priority and drives enormous opportunity for business improvement but data concerns are escalating because in part the growth of data is exceeding the capacity of traditional computing The question every organization now faces to varying degree is how do we effectively consume those data sources and transform them into actionable information In a recent report Gartner estimates that 20 of enterprise data is mission critical 30 is redundant and 50 is of indeterminate value Leading companies embed analytics into their organizations by resolving to be data driven and defining what they hope to accomplish through their use of Big Data The CEO and top leadership team need to describe how analytics will shape the business s performance whether by improving existing products and services optimizing internal processes building new products or service offerings or transforming business models Top performing organizations do this well often building their organizations around data and a commitment to make data driven decisions Bain In the context of big data dark describes something that is hidden or undigested According to Gartner dark data is the information assets organizations collect process and store during regular business activities but generally fail to use for other purposes Dark analytics focuses primarily on raw text based data that has not been analyzed with an emphasis on unstructured data which may include things such as text messages documents email video and audio files and still images In some cases dark analytics explorations could also target the deep web which comprises everything online that is not indexed by search engines including a small subset of anonymous inaccessible sites known as the dark web It is impossible to accurately calculate the deep web s size but by some estimates it is 500 times larger than the surface web that most people search daily

Page 4

Across enterprises ever expanding stores of data remain unstructured and unanalyzed Few organizations have been able to explore non traditional data sources such as image audio and video files the torrent of machine and sensor information generated by the Internet of Things and the enormous troves of raw data found in the unexplored recesses of the deep web However recent advances in computer vision pattern recognition and cognitive analytics are making it possible for companies to shine a light on these untapped sources and derive insights that lead to better experiences and decision making across the business Deloitte Insights It is generally considered that as more advanced computing systems for analysis of data are built the higher the value of dark data will be It has been noted that data and analytics will be the foundation of the modern industrial revolution Of course this includes data that is currently considered dark data since there are not enough resources to process it to meet the demand Furthermore many organisations do not realise the value of dark data right now Today s businesses are used to working with structured data It s information housed in databases and spreadsheets usually numbers that can be easily organized in rows and columns Because it s in a structured format it can be sliced diced and configured to create actionable insight Unstructured data on the other hand is information that exists outside of databases and spreadsheets It s embedded in disparate sources including web pages PDFs documents emails social media etc Often it s in a text format with a lot of language variances making it difficult to compile process or analyze using traditional IT algorithms So it leaves executives in the dark and unable to make business decisions Unfortunately today s businesses have much more unstructured data than structured data It s estimated that unstructured data accounts for over 80 percent of all business data And given trends in data proliferation it s projected to grow to nearly 95 percent by 2020 This means even that most data driven companies are only working with a small fraction of their business critical information They re essentially sitting on a treasure trove of untapped insight Sanjay Srivastava Today 80 percent of data comes from previously untapped unstructured information from the web such as imagery social media channels news feeds emails journals blogs images sounds and videos The key to making sure that data doesn t become dark is to ensure that it is properly categorized the moment it is generated or arrives in your province It s time to light up dark data This notion of unstructured data and the ability to light up this dark data the time for that is now The ability to tap into it and mine it and use it for decision making is becoming mainstream It s becoming table stakes Paul Roma Organizations retain dark data for a multitude of reasons and it is estimated that most companies are only analyzing 1 of their data Often it is stored for regulatory compliance and record keeping Some organizations believe that dark data could be useful to them in the future once they have acquired better analytic and business intelligence technology to process the information

Page 5

What is Data Archiving As organizations strive to aggregate and utilize their structured unstructured and dark data unified archiving has become essential Data can be classified as hot warm or cold Currently utilized data hot is processed in parallel in memory databases while much larger volumes of less frequently accessed data warm and archived data cold reside in various platforms While data archiving is commonly viewed as the process of moving data that is no longer actively used to a separate storage device for long term retention today s big data archiving demands a more robust process that includes the discerning aggregation secure archiving and total access of massive of amounts of data that will undoubtedly prove useful and invaluable in the future Archived data also consists of all the data that must be retained for regulatory compliance and discovery Data storage compliance and corporate governance regulations are having a tremendous impact on the practices employed to retain search certify and destroy data Storage compliance regulations typically focus on three distinct areas of interest retention integrity and security It s not just the major regulations like the Sarbanes Oxley Act SOX or the Health Insurance Portability and Accountability Act HIPAA which influence storage there are well over 10 000 regulations that affect data storage backup and protection across a range of industries Organizations are also viewed as custodians of data they collect so any breach of data especially sensitive and confidential data can result in significant losses and exposure for damages Data archives are indexed and have search capabilities so files and parts of files can be easily located and retrieved Search tools are actually used to locate the data for retrieval Searches can utilize the metadata indexes or even look inside some files to perform deeper contextual searches of file content However if data aggregation is done by separately in the same organization there may be cases where they do not interact with each other because of technological constraints This prevents bringing all the data together in order to create a cohesive picture This happens especially for organizations that have different systems and formats Due to increasing big data aggregation organizations are now recognizing that their existing infrastructure is unintentionally creating data sprawl This inability to meet the needs of new big data analytics is driving organizations toward consolidation with unified archiving Identifying classifying categorizing indexing archiving securing and effectively viewing and accessing this burgeoning amount of data is more challenging and riskier than ever and no one provides a better archiving solution than Capax Archive Solutions 3 0 Capax Archive Solutions 3 0 has been built to archive data better than any other solution in the world Our unified archiving program provides the discerning aggregation secure archiving and total access to all of an organization s structured unstructured and dark data But more importantly our solution goes beyond powerful archiving to give our clients total control and ownership of all their data in our various PremCloud Hosting environments

Page 6

What is Capax Archive Solutions 3 0 Capax Archive Solutions 3 0 automatically captures indexes and archives messages and information based on corporate policy enabling our clients to archive retain manage view access and retrieve important email messages files and dark data It also ensures that our clients are prepared to efficiently cost effectively and accurately respond to discovery requests for archived data whether for regulatory or litigation purposes CAS 3 0 delivers industry leading information governance with comprehensive and easy to use e discovery features With updated capabilities that meet today s litigation and regulatory obligations data from across your organization can be quickly searched preserved reviewed and exported But most importantly CAS 3 0 provides our clients with total access to their archived legacy data for future use as more robust utilization of their big data becomes available Our entire focus at Capax Discovery is on building state of the art data aggregation tools and archiving solutions in our PremCloud substantiating and elevating our prowess as data archivists Recently we announced the launch of our new Microsoft Office 365 tool CAS O Bridge Originally deployed in beta format in June 2017 our new CAS O Bridge establishes us as the industry leader in Microsoft Office 365 integration with a first of its kind bi directional integration that delivers evolved control governance and value CAS 3 0 builds on this ground breaking technology by mobilizing the entire CAS platform to stratify any deployment model CAS 3 0 can now be on premise hybrid hosted or entirely managed with Capax Discovery s new PremCloud TM End users can now access email files and all other archived data types through any supported browser on all mainstream operating systems and device types As the most trusted and secure archiving software solution on the market CAS 3 0 is backed by our world class services and technical support as well as our expertise in migration consulting and managed services Capax Discovery continues to provide our clients reduced information governance costs and risks while maximizing data value Capax Managed Services provides the services required for our clients to effectively and securely manage their most valuable asset their data When it comes to providing our world class Capax Archive Solutions the deciding factor in its efficacy relies on the contribution of our solution engineering professional services maintenance and support teams that assure secure and reliable performance Capax Managed Services answers the call for all of our clients needs with a speed to performance that exceeds all industry standards It is Capax Managed Services main objective to make certain that all of our Capax Archive Solutions are effectively implemented and that they are operating at their maximum performance capabilities for all of our clients In the end optimal functionality for our clients is essential to everyone s success Our Capax Discovery Professional Services team will also perform migrations project based software development and custom consulting engagements for clients around the globe

Page 7

What is the Prem Cloud The days of relying solely on legacy data centers are over The new data center landscape is largely virtualized and spread across multiple geographically distributed sites and public private and hybrid cloud environments But building connecting and securing this array of computing power is a complex job and one that requires a new type of data center network starting with an architecture optimized for the cloud era and an infrastructure that gives our clients agility automation and simplicity But simply choosing a cloud solution for unified archiving leaves our multinational clients at perilous risk and more importantly without full ownership and control of their data Fortunately Capax Discovery has eliminated the disadvantage of having to choose between on premise and cloud hosted data by presenting a more nimble paradigm for the location of data and the software solutions we offer Our new PremCloud TM Hosting perspective speaks directly to the strategic versatility of our hosting alternatives that we can now provide our clients especially our multinational clients Our customer s data storage needs can no longer be satisfied by the limitation of simply choosing between a cloud hosted or on premise solution there is a need for a more adaptable hosting alternative and that solution is in our PremCloud TM Capax Discovery s ground breaking PremCloud TM provides adaptable data archiving solutions with managed services for our accounts worldwide regardless of where their data resides in our PremCloud In our PremCloud our clients can rely on our software and Capax Managed Services for the archiving security management and performance of their valuable and ever increasing data Every multinational enterprise is bound to have a variety of native domicile conditions to deal with Sovereignty dominion residency accessibility laws restrictions custody compliance costs risk intellectual resources security safety and manageability will undoubtedly differ in each location If one of our customers were to be an international enterprise they would undoubtedly require different hosting environments for their archived data For example in Germany they may want to host their data in their own data center and have us provide the appropriate managed services for our CAS 3 0 solution while in Hong Kong they may want us to host it for them as well And in Canada they may want to host it but have us fully manage it And on and on The key is that our customers always truly own their data while they can rely on us for the security management and administrative optic that our market leading Prem Cloud TM based solutions provide And what CAS 3 0 and our newest apps can do with data for our enterprise clients is truly magic and that magic starts with Prem Cloud Hosting TM Many organizations share a vision of having all workloads analytics and applications running on a common dataset or a scalable multi tenant platform for all data and analytics workloads This movement begins with consolidating data PremCloud TM storage is a service model in which data is maintained managed backed up remotely and made available to users over a network through our PremCloud TM dashboard Our Prem Cloud TM Total Access administrative optic with CISO Sentinel allows our customers total access to their data as well as the governance of the entire processing of data coming in and going out of their archiving vault This allows for not only a better administrative optic but it also allows for a more robust use of their valuable archived data for a limitless amount of utilization

Page 8

Original sketch of the Prem Cloud 2017