top questions for evaluating hadoop platform vendors

3
Memo from Analytix.is 1 Top Questions for Evaluating Hadoop Platform Vendors Category Requirement Strategic Fit Market Presence: What was your CY2014 worldwide revenue from Big Data solutions? Strategic Fit Number and Quality of References: Please provide 10 customers in our industry globally. At least 5 of these must be willing to entertain a reference check Strategic Fit Hadoop Expertise: How long have your technical leaders worked with Apache Hadoop? Strategic Fit Apache Leadership: How many of your engineers are Apache Software Foundation (ASF) committers and PMC members, for relevant Hadoop and Spark projects? Strategic Fit Open Source Philosophy: List components of your distribution that are governed by projects within the ASF and which are not. Strategic Fit Differentiators: How is your solution differentiated visavis your competitors? Please name the specific competitor products in any comparison you provide Strategic Fit EDW Integration: Please describe how queries with predicate pushdown can be initiated on the existing EDW and the execute in distributed but coordinated fashion across both Hadoop and the EDW. Please list any certifications in that respect Strategic Fit Modular Architecture based on open standards: Please provide evidence that proprietary components are absent or widely supported by an ecosystem with clearly specified APIs; any evidence that your solution is not a black box and has no vendor lockin Strategic Fit Roadmap: Please provide as much clarity as possible on your roadmap. What is the anticipated release date and new feature list for each of the product(s) component(s) over the next 12 months? Strategic Fit Technology Partnerships / Product Extendability: Please list which relevant thirdparty software in areas like visualization, application development, machine learning, data science, security is certified on your distribution Strategic Fit Analytic App Dev: What functionality enables analytic application development, such as wellmaintained APIs, programming frameworks in common languages, an integrated developer environment / SDK Strategic Fit Visualization: Access from existing reporting & visualization tools (QLIK, BOBJ Explorer, Tableau). Please list which are certified. Strategic Fit Industryspecific IP: Please list outofthebox functionality for specific use cases in our industry Strategic Fit Service Intensity: For typical deployments at your customers (please state parameters indicating scope / size of deployment), how much professional services FTE days are required for a) the deployment, b) ongoing operations

Upload: juergenurbanski

Post on 21-Nov-2015

32 views

Category:

Documents


0 download

DESCRIPTION

Top Questions for Evaluating Hadoop Platform Vendors

TRANSCRIPT

  • Memo from Analytix.is

    1

    Top Questions for Evaluating Hadoop Platform Vendors Category Requirement Strategic Fit Market Presence: What was your CY2014 worldwide revenue from Big Data solutions? Strategic Fit Number and Quality of References: Please provide 10 customers in our industry globally. At least 5 of these must be willing to entertain a reference check Strategic Fit Hadoop Expertise: How long have your technical leaders worked with Apache Hadoop? Strategic Fit Apache Leadership: How many of your engineers are Apache Software Foundation (ASF) committers and PMC members, for relevant Hadoop and Spark projects? Strategic Fit Open Source Philosophy: List components of your distribution that are governed by projects within the ASF and which are not. Strategic Fit Differentiators: How is your solution differentiated vis-a-vis your competitors? Please name the specific competitor products in any comparison you provide Strategic Fit EDW Integration: Please describe how queries with predicate pushdown can be initiated on the existing EDW and the execute in distributed but coordinated fashion across both Hadoop and the EDW. Please list any certifications in that respect Strategic Fit Modular Architecture based on open standards: Please provide evidence that proprietary components are absent or widely supported by an ecosystem with clearly specified APIs; any evidence that your solution is not a black box and has no vendor lock-in Strategic Fit Roadmap: Please provide as much clarity as possible on your roadmap. What is the anticipated release date and new feature list for each of the product(s) component(s) over the next 12 months? Strategic Fit Technology Partnerships / Product Extendability: Please list which relevant third-party software in areas like visualization, application development, machine learning, data science, security is certified on your distribution Strategic Fit Analytic App Dev: What functionality enables analytic application development, such as well-maintained APIs, programming frameworks in common languages, an integrated developer environment / SDK Strategic Fit Visualization: Access from existing reporting & visualization tools (QLIK, BOBJ Explorer, Tableau). Please list which are certified. Strategic Fit Industry-specific IP: Please list out-of-the-box functionality for specific use cases in our industry Strategic Fit Service Intensity: For typical deployments at your customers (please state parameters indicating scope / size of deployment), how much professional services FTE days are required for a) the deployment, b) ongoing operations

  • Memo from Analytix.is

    2

    Strategic Fit PS Ecosystem: Please describe the breadth and depth of your Professional Services Ecosystem (listing number of partners and respective number of certified staff) Strategic Fit Testing: Describe your testing and certification process, prior to releasing a new version of your distribution Strategic Fit Switching Costs: Can we use the ASF community to self-support your solution? How quickly do ASF fixes make it into the software? Commercial Needs Speed of deployment: ability to support a pilot in 1 month, full production in 3 months. E.g., can some solution modules be downloaded, pilot data ingested, staff trained while commercials are still being worked on? Commercial Needs Support capability 24x7 in English including SPOC for L1 calls Commercial Needs Training: Please list your publicly available training & certification offerings Commercial Needs Flexible pricing structure that enables extensions of capacity and capability in modular increments. Explain your pricing structure and list price levels. Commercial Needs Data Volume: Describe how performance and storage pricing changes at different amounts of data. Data Management High Availability: What HA features do you provide to minimize data center outages? Does it support zero downtime? Data Management Portability & Deployment: What operating system(s) do you support? Can we deploy in the cloud (e.g., for test/dev)? Data Management Heterogeneous Infrastructure: How does your Hadoop solution deal with disparate hardware pools in one data center (e.g., newer hardware next to older hardware; memory vs. storage optimized) Data Access Extensibility: Would we be able to add new data applications that run in YARN? What YARN-ready ISV apps run on your platform? Data Access Use Cases: Do you have customers in our industry that use your distribution? Please describe the scope and use cases for up to 10 customers in our industry Data Access Processing Engines: Can we simultaneously process data residing in the same multi-purpose cluster with many different processing engines? Which big data relevant processing engines does your offering include? Our needs range across batch, interactive, real-time, streaming, search, machine learning, rules engine, recommendation engine, and data science Data Access Existing Skills: Will our analysts be able to use their existing SQL skills to query data in Hadoop? Please list the SQL instruction set supported by your distribution Integration & Governance Data Ingest: How would you move our raw data sources into Hadoop? What specific connectors do you provide for common telco network data sources / vendors? Please describe how your solution supports a wide range of ingestion capabilities

  • Memo from Analytix.is

    3

    incl. NFS access to HDFS Integration & Governance Replication and Retention: How will we be able to set centralized policies for data replication and retention? Integration & Governance Pipeline Monitoring: How do we gain insight into source lineage of data in Hadoop? Can we set explicit policies for data flows? Security Administration: Do you provide central administration of security policy within the cluster? How do you coordinate enforcement across workloads? Security Authentication: How does your solution verify the identity of users and systems accessing the cluster? Please describe how you achieve granular role-based access control via AD, LDAP, Kerberos, Federated Identity, etc. Security Authorization: How do you provide fine-grain authorization to access data in the cluster? Security Audit: How will we be able to audit the actions taken by individual users? Please describe your auditing capabilities to track changes to configurations and data access (MR jobs, REST API, etc.). Provide a list of activities or events, with corresponding information and attributes that are logged. Security Multi-Tenancy Internal: Please describe how you achieve internal multi-tenancy: Tenant, data, network and namespace separation in all services Security Multi-Tenancy External: Please describe how you achieve external multi-tenancy: Support of tenants outside of our company served by the same physical data lake instance that serves internal tenants Operations Provision: What operating system(s) do you support? What are our cloud/on-premises deployment options? Operations Manage: What tools do you provide for the ongoing management of a Hadoop cluster? If GUI, please include screen shots Operations Monitor: Do you provide a single pane of glass to monitor all cluster components, with respect to performance, availability, and other runtime characteristics? Operations Extend: Can your management tool integrate with existing operations consoles? In particular HP Openview, IBM and Teradata Viewpoint