data science glossary - intotheminds

9

Upload: others

Post on 22-Nov-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: data science glossary - IntoTheMinds
Page 2: data science glossary - IntoTheMinds

datascienceglossary1.NaturalLanguageProcessing

1.1.Tokenization1.2.Normalization1.3.Stemming1.4.Lemmatization1.5.Corpus1.6.StopWords1.7.Parts-of-speech(POS)Tagging1.8.StatisticalLanguageModeling1.9.BagofWords1.10.n-grams1.11.RegularExpressions1.12.Zipf'sLaw1.13.SimilarityMeasures1.14.SyntacticAnalysis1.15.SemanticAnalysis1.16.SentimentAnalysis1.17.InformationRetrieval

2.InternetofThings(IoT)2.1.6LoWPAN2.2.AdvancedEncryptionStandard(AES)2.3.ApplicationProgrammingInterface(API)2.4.BluetoothLowEnergy(BLE)2.5.EmbeddedSoftware

Page 3: data science glossary - IntoTheMinds

2.6.Gateway2.7.GeneralPacketRadioService(GPRS)2.8.Industrial,Scientific,andMedical(ISM)Band2.9.LinkBudget2.10.MachinetoMachine(M2M)2.11.MediaAccessControl(MAC)

3.PredictiveAnalytics3.1.PredictiveModel3.2.ArtificialIntelligence3.3.UpliftModel3.4.VastSearch3.5.AutomaticSuspectDiscovery(ASD)

4.Database4.1.RelationalDatabase4.2.DatabaseManagementSystem(DBMS)4.3.PrimaryKey4.4.ForeignKey4.5.StructuredQueryLanguage(SQL)4.6.NoSQL4.7.Metadata4.8.Consistency4.9.DataRedundancy4.10.ACID4.11.CAPTheorem4.12.Sharding4.13.Key-valueStore

Page 4: data science glossary - IntoTheMinds

4.14.DocumentStore4.15.Column-orientedDatabase4.16.GraphDatabase

5.Clustering5.1.FeatureSelection5.2.ExpectationMaximization(EM)5.3.Distance-basedMethods5.4.Density-andGrid-BasedMethods5.5.MatrixFactorization5.6.SpectralMethods5.7.Graph-basedTechniques5.8.Streamingscenario

6.BigData6.1.BigDataVolume6.2.BigDataVelocity6.3.BigDataVariety6.4.BigDataVeracity6.5.BigDataVariability6.6.BigDataValue6.7.PredictiveAnalytics6.8.DescriptiveAnalytics6.9.PrescriptiveAnalytics6.10.Database6.11.DataWarehouse6.12.ETL6.13.BusinessIntelligence

Page 5: data science glossary - IntoTheMinds

6.14.ApacheHadoop6.15.ApacheSpark6.16.Datalake6.17.Datamining6.18.Datapreparation6.19.Datavault6.20.Datamunging6.21.Datawrangling6.22.Datagovernance6.23.Datastewardship6.24.Datavisualization6.25.DataStorytelling

7.MachineLearning7.1.Classification7.2.Regression7.3.Clustering7.4.Association7.5.DecisionTrees7.6.SupportVectorMachines7.7.NeuralNetworks7.8.DeepLearning7.9.ReinforcementLearning7.10.(k-fold)Cross-validation7.11.Bayesian7.12.RandomForest

8.deeplearning

Page 6: data science glossary - IntoTheMinds

8.1.ArtificialNeuralNetworks(ANNs)8.2.BiologicalNeuron8.3.Perceptron8.4.MultilayerPerceptron(MLP)8.5.FeedforwardNeuralNetwork8.6.RecurrentNeuralNetwork8.7.ActivationFunction8.8.Backpropagation8.9.CostFunction8.10.GradientDescent8.11.VanishingGradientProblem8.12.ConvolutionalNeuralNetwork8.13.LongShortTermMemoryNetwork(LSTM)

9.DescriptiveStatistics9.1.Population9.2.Sample9.3.Parameter9.4.Statistic9.5.Generalizability9.6.Distribution9.7.Mean9.8.Median9.9.Mode9.10.Skew9.11.Range9.12.Variance

Page 7: data science glossary - IntoTheMinds

9.13.StandardDeviation9.14.InterquartileRange(IQR)

10.CloudComputing10.1.XaaS(Anything-as-a-Service)10.2.Software-as-a-Service(SaaS)10.3.Platform-as-a-Service(PaaS)10.4.Infrastructure-as-a-Service(IaaS)10.5.PublicCloud10.6.PrivateCloud10.7.HybridCloud10.8.AWS10.9.AmazonEC2(ElasticCloudCompute)10.10.AmazonSimpleStorageService(S3)10.11.CloudSourcing10.12.ConsumerCloud10.13.Multi-tenancy10.14.VerticalCloud10.15.CloudPortability10.16.CloudBackup10.17.CloudEnablement10.18.CloudMigration10.19.Cloudstorming10.20.CloudBroker

11.Hadoop11.1.MapReduce11.2.HadoopDistributedFileSystem(HDFS)

Page 8: data science glossary - IntoTheMinds

11.3.YetAnotherResourceNegotiator(YARN)11.4.HBase11.5.Hive11.6.ApachePig11.7.ApacheSpark11.8.Sqoop11.9.Oozie11.10.ZooKeeper11.11.ApacheFlume11.12.Hue11.13.Mahout11.14.Ambari11.15.HadoopCommon

12.Apache12.1.RDD12.2.DataFrame12.3.Dataset12.4.MLlib12.5.MLPipelines12.6.GraphX12.7.SparkStreaming12.8.StructuredStreaming12.9.spark-packages.org12.10.CatalystOptimizer12.11.Tungsten12.12.ContinuousApplications

Page 9: data science glossary - IntoTheMinds

12.13.In-memorycomputing