데이터 과학 기반의 파이썬 빅데이터 분석 Chapter08 텍스트 빈도 분석

2023. 1. 8. 23:49Python/데이터 과학 기반의 파이썬 빅데이터 분석(한빛 아카데미)

01 [영문 분석 + 워드클라우드]

영문 문서 제목의 키워드 분석하기

!pip install matplotlib
!pip install wordcloud
!pip install nltk
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: matplotlib in /usr/local/lib/python3.8/dist-packages (3.2.2)
Requirement already satisfied: numpy>=1.11 in /usr/local/lib/python3.8/dist-packages (from matplotlib) (1.21.6)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib) (3.0.9)
Requirement already satisfied: python-dateutil>=2.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib) (2.8.2)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.8/dist-packages (from matplotlib) (0.11.0)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib) (1.4.4)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.8/dist-packages (from python-dateutil>=2.1->matplotlib) (1.15.0)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: wordcloud in /usr/local/lib/python3.8/dist-packages (1.8.2.2)
Requirement already satisfied: numpy>=1.6.1 in /usr/local/lib/python3.8/dist-packages (from wordcloud) (1.21.6)
Requirement already satisfied: matplotlib in /usr/local/lib/python3.8/dist-packages (from wordcloud) (3.2.2)
Requirement already satisfied: pillow in /usr/local/lib/python3.8/dist-packages (from wordcloud) (7.1.2)
Requirement already satisfied: python-dateutil>=2.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib->wordcloud) (2.8.2)
Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib->wordcloud) (1.4.4)
Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.8/dist-packages (from matplotlib->wordcloud) (3.0.9)
Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.8/dist-packages (from matplotlib->wordcloud) (0.11.0)
Requirement already satisfied: six>=1.5 in /usr/local/lib/python3.8/dist-packages (from python-dateutil>=2.1->matplotlib->wordcloud) (1.15.0)
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: nltk in /usr/local/lib/python3.8/dist-packages (3.7)
Requirement already satisfied: click in /usr/local/lib/python3.8/dist-packages (from nltk) (7.1.2)
Requirement already satisfied: regex>=2021.8.3 in /usr/local/lib/python3.8/dist-packages (from nltk) (2022.6.2)
Requirement already satisfied: tqdm in /usr/local/lib/python3.8/dist-packages (from nltk) (4.64.1)
Requirement already satisfied: joblib in /usr/local/lib/python3.8/dist-packages (from nltk) (1.2.0)

import nltk
nltk.download('stopwords')
[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
True

import nltk
nltk.download('punkt')
[nltk_data] Downloading package stopwords to /root/nltk_data...
[nltk_data]   Package stopwords is already up-to-date!
True

import nltk
nltk.download('wordnet')
[nltk_data] Downloading package wordnet to /root/nltk_data...
[nltk_data]   Package wordnet is already up-to-date!
True

import nltk
nltk.download('omw-1.4')
[nltk_data] Downloading package omw-1.4 to /root/nltk_data...
[nltk_data]   Package omw-1.4 is already up-to-date!
True

import pandas as pd
import glob
import re
from functools import reduce
from nltk.tokenize import word_tokenize
from nltk.corpus import stopwords
from nltk.stem import WordNetLemmatizer
from collections import Counter
import matplotlib.pyplot as plt
from wordcloud import STOPWORDS, WordCloud

all_files = glob.glob('myCabinetExcelData*.xls')
all_files   #출력하여 내용 확인
['myCabinetExcelData (4).xls',
 'myCabinetExcelData.xls',
 'myCabinetExcelData (2).xls',
 'myCabinetExcelData (3).xls',
 'myCabinetExcelData (8).xls',
 'myCabinetExcelData (6).xls',
 'myCabinetExcelData (9).xls',
 'myCabinetExcelData (7).xls',
 'myCabinetExcelData (5).xls',
 'myCabinetExcelData (1).xls']
 
 all_files_data = []     #저장할 리스트
for file in all_files:
    data_frame = pd.read_excel(file)
    all_files_data.append(data_frame)
all_files_data[0]      #작업 내용 확인

all_files_data_concat = pd.concat(all_files_data, axis = 0, ignore_index = True)
all_files_data_concat   #출력하여 내용 확인

all_files_data_concat.to_csv('riss_bigdata.csv', encoding = 'utf-8', index = False)

데이터 전처리

all_title = all_files_data_concat['제목']
all_title   #출력하여 내용 확인
0      A Big-Data-based platform of workers’ behavior...
1      Using Big-data and Surface Fitting to Improve ...
2      Using Big-data and Surface Fitting to Improve ...
3      A big-data method to predict the targets of sm...
4      Social Big-Data-Based Content Dissemination in...
                             ...                        
995    Optimized Deep Learning for EEG Big Data and S...
996    DiP-SVM : Distribution Preserving Kernel Suppo...
997    Perceptions of independent financial advisors ...
998    Fast approaches for semantic service compositi...
999    Availability modelling and assurance for a big...
Name: 제목, Length: 1000, dtype: object

stopWords = set(stopwords.words("english"))
lemma = WordNetLemmatizer()

words = []

for title in all_title:
    EnWords = re.sub(r"[^a-zA-Z]+", " ", str(title))
    EnWordsToken = word_tokenize(EnWords.lower())
    EnWordsTokenStop = [w for w in EnWordsToken if w not in stopWords]
    EnWordsTokenStopLemma = [lemma.lemmatize(w) for w in EnWordsTokenStop]
    words.append(EnWordsTokenStopLemma)
    print(words)    #출력하여 내용 확인
IOPub data rate exceeded.
The notebook server will temporarily stop sending output
to the client in order to avoid crashing it.
To change this limit, set the config variable
`--NotebookApp.iopub_data_rate_limit`.

Current values:
NotebookApp.iopub_data_rate_limit=1000000.0 (bytes/sec)
NotebookApp.rate_limit_window=3.0 (secs)

words2 = list(reduce(lambda x, y: x+y, words))
print(words2)   #출력하여 내용 확인
['big', 'data', 'based', 'platform', 'worker', 'behavior', 'observation', 'field', 'using', 'big', 'data', 'surface', 'fitting', 'improve', 'aircraft', 'safety', 'study', 'relationship', 'anomaly', 'using', 'big', 'data', 'surface', 'fitting', 'improve', 'aircraft', 'safety', 'study', 'relationship', 'anomaly', 'big', 'data', 'method', 'predict', 'target', 'small', 'molecule', 'accelerate', 'drug', 'discovery', 'social', 'big', 'data', 'based', 'content', 'dissemination', 'internet', 'vehicle', 'big', 'data', 'analysis', 'disaster', 'information', 'dissemination', 'south', 'korea', 'real', 'world', 'big', 'data', 'study', 'laboratory', 'medicine', 'current', 'status', 'application', 'future', 'consideration', 'applying', 'big', 'data', 'technology', 'network', 'architecture', 'ecology', 'big', 'data', 'approach', 'lead', 'increased', 'understanding', 'ecology', 'animal', 'movement', 'persisting', 'big', 'data', 'nosql', 'landscape', 'big', 'data', 'analysis', 'political', 'rhetoric', 'relating', 'development', 'united', 'state', 'china', 'global', 'power', 'big', 'data', 'approach', 'understanding', 'thematic', 'landscape', 'field', 'business', 'ethic', 'big', 'data', 'security', 'mechanism', 'based', 'fully', 'homomorphic', 'encryption', 'using', 'cubic', 'spline', 'curve', 'public', 'key', 'cryptography', 'big', 'data', 'perspective', 'ai', 'newton', 'merton', 'analytics', 'intelligence', 'big', 'data', 'spatial', 'temporal', 'network', 'analysis', 'bovine', 'tuberculosis', 'wildlife', 'badger', 'cattle', 'big', 'data', 'company', 'enter', 'classroom', 'pioneer', 'issue', 'warning', 'big', 'data', 'model', 'multi', 'modal', 'public', 'transportation', 'application', 'macroscopic', 'control', 'optimisation', 'big', 'data', 'oriented', 'recommendation', 'method', 'based', 'multi', 'objective', 'optimization', 'sears', 'big', 'data', 'strategy', 'service', 'call', 'away', 'big', 'data', 'job', 'outlook', 'encouraging', 'gm', 'plan', 'hire', 'hp', 'employee', 'dynamic', 'big', 'data', 'broadcast', 'fat', 'tree', 'data', 'center', 'network', 'mobile', 'iot', 'device', 'becoming', 'data', 'savvy', 'big', 'data', 'world', 'bifm', 'big', 'data', 'driven', 'intelligent', 'forecasting', 'model', 'covid', 'distributed', 'big', 'data', 'optimization', 'via', 'blockwise', 'gradient', 'tracking', 'big', 'data', 'based', 'platform', 'worker', 'behavior', 'observation', 'field', 'industrial', 'big', 'data', 'driven', 'cps', 'based', 'adaptive', 'production', 'scheduling', 'smart', 'manufacturing', 'big', 'data', 'driven', 'intelligent', 'wireless', 'network', 'architecture', 'use', 'case', 'solution', 'future', 'trend', 'streetlytics', 'big', 'data', 'real', 'estate', 'market', 'analysis', 'mobile', 'big', 'data', 'driven', 'rating', 'framework', 'measuring', 'relationship', 'human', 'mobility', 'app', 'usage', 'behavior', 'big', 'data', 'driven', 'approach', 'analyzing', 'modeling', 'human', 'mobility', 'trend', 'non', 'pharmaceutical', 'intervention', 'covid', 'pandemic', 'recent', 'big', 'data', 'struggle', 'birthing', 'pain', 'researcher', 'say', 'selbstlernende', 'big', 'data', 'analysen', 'f', 'r', 'intelligentes', 'energiemanagement', 'big', 'data', 'analytics', 'method', 'capturing', 'visitor', 'activity', 'flow', 'case', 'island', 'country', 'research', 'big', 'data', 'mining', 'visualization', 'application', 'tensor', 'based', 'big', 'data', 'driven', 'routing', 'recommendation', 'approach', 'heterogeneous', 'network', 'comprehensive', 'big', 'data', 'based', 'monitoring', 'system', 'yield', 'enhancement', 'semiconductor', 'manufacturing', 'cloud', 'big', 'data', 'driven', 'dynamic', 'control', 'approach', 'unmanned', 'ground', 'vehicle', 'safety', 'improving', 'optimizing', 'read', 'data', 'flow', 'big', 'data', 'application', 'editorial', 'advanced', 'big', 'data', 'analysis', 'method', 'tool', 'high', 'throughput', 'omics', 'technology', 'traditional', 'v', 'big', 'data', 'fashion', 'trend', 'forecasting', 'examination', 'using', 'wgsn', 'edited', 'distributed', 'partitioned', 'big', 'data', 'optimization', 'via', 'asynchronous', 'dual', 'decomposition', 'lesson', 'big', 'data', 'project', 'four', 'health', 'data', 'network', 'illustrate', 'potential', 'shared', 'national', 'multipurpose', 'big', 'data', 'network', 'ibm', 'stellt', 'big', 'data', 'service', 'au', 'twitter', 'vor', 'leveraging', 'big', 'data', 'revolution', 'cm', 'expanding', 'capability', 'spur', 'health', 'system', 'transformation', 'st', 'century', 'investing', 'big', 'data', 'company', 'soaring', 'despite', 'high', 'price', 'still', 'possible', 'cash', 'advancing', 'manufacturing', 'system', 'big', 'data', 'analytics', 'conceptual', 'framework', 'industrial', 'symbiosis', 'exploring', 'big', 'data', 'approach', 'waste', 'stream', 'discovery', 'ari', 'caroline', 'big', 'data', 'guru', 'mine', 'analytics', 'help', 'cancer', 'patient', 'resource', 'career', 'methodology', 'real', 'time', 'data', 'sustainability', 'smart', 'city', 'towards', 'inferencing', 'analytics', 'big', 'data', 'conex', 'efficient', 'exploration', 'big', 'data', 'system', 'configuration', 'better', 'performance', 'daten', 'und', 'diabetes', 'big', 'data', 'medizin', 'mobile', 'edge', 'computing', 'big', 'data', 'enabled', 'electric', 'vehicle', 'charging', 'accountability', 'big', 'data', 'algorithm', 'enhancing', 'physical', 'layer', 'security', 'via', 'big', 'data', 'aided', 'hybrid', 'relay', 'selection', 'political', 'depression', 'big', 'data', 'multimethod', 'investigation', 'american', 'emotional', 'response', 'trump', 'presidency', 'framework', 'aligning', 'big', 'data', 'strategy', 'organizational', 'goal', 'semantic', 'interoperability', 'big', 'data', 'heterogeneous', 'iot', 'infrastructure', 'healthcare', 'middleware', 'managing', 'big', 'data', 'flow', 'pipelined', 'multi', 'gpu', 'mapreduce', 'big', 'data', 'processing', 'performance', 'evaluation', 'nosql', 'big', 'data', 'application', 'using', 'multi', 'formalism', 'model', 'pmu', 'application', 'real', 'world', 'data', 'medical', 'big', 'data', 'platform', 'real', 'world', 'evidence', 'generation', 'practice', 'post', 'marketing', 'research', 'china', 'jbhi', 'special', 'issue', 'big', 'data', 'health', 'situating', 'ecology', 'big', 'data', 'science', 'current', 'advance', 'challenge', 'solution', 'integrative', 'biology', 'big', 'data', 'centrism', 'mapping', 'bioscience', 'ethic', 'perspective', 'w', 'matrix', 'comparative', 'evaluation', 'big', 'data', 'system', 'scientific', 'image', 'analytics', 'workload', 'facebook', 'fiasco', 'big', 'data', 'researcher', 'rethink', 'ethic', 'abnormal', 'operation', 'tracking', 'big', 'data', 'based', 'gram', 'schmidt', 'orthogonalization', 'production', 'n', 'propyl', 'propionate', 'simulated', 'moving', 'bed', 'reactor', 'case', 'study', 'israeli', 'firm', 'medaware', 'offer', 'big', 'data', 'software', 'help', 'detect', 'prescription', 'error', 'career', 'graduate', 'program', 'big', 'data', 'analyst', 'computing', 'cloud', 'cover', 'big', 'data', 'era', 'putting', 'pressure', 'storage', 'method', 'analysis', 'big', 'data', 'science', 'backwards', 'high', 'performance', 'solution', 'big', 'data', 'gwas', 'smallholder', 'need', 'access', 'big', 'data', 'agronomy', 'ultralow', 'power', 'memory', 'based', 'big', 'data', 'computing', 'platform', 'nonvolatile', 'domain', 'wall', 'nanowire', 'device', 'design', 'implementation', 'big', 'data', 'analysis', 'application', 'spark', 'distribution', 'network', 'based', 'data', 'interception', 'seeking', 'alpha', 'dividend', 'announcement', 'big', 'data', 'insight', 'joining', 'car', 'eva', 'style', 'analysis', 'interview', 'mit', 'stefan', 'wrobel', 'zum', 'thema', 'angewandte', 'big', 'data', 'forschung', 'stochastic', 'approximation', 'statistical', 'origin', 'big', 'data', 'multidisciplinary', 'application', 'code', 'offloading', 'scheme', 'big', 'data', 'processing', 'android', 'application', 'fast', 'algorithm', 'relaxation', 'process', 'big', 'data', 'system', 'page', 'phylogenomics', 'principle', 'opportunity', 'pitfall', 'big', 'data', 'phylogenetics', 'deadline', 'aware', 'scheduling', 'flexible', 'bandwidth', 'allocation', 'big', 'data', 'transfer', 'ad', 'exec', 'future', 'big', 'data', 'draper', 'physical', 'education', 'grand', 'convergence', 'fitnessgram', 'big', 'data', 'digital', 'commerce', 'child', 'health', 'vergleich', 'von', 'kompetenzanforderungen', 'business', 'intelligence', 'und', 'big', 'data', 'spezialisten', 'starting', 'think', 'big', 'data', 'approach', 'clinical', 'detection', 'early', 'stage', 'bipolar', 'disorder', 'evidence', 'based', 'assessment', 'statistical', 'learning', 'model', 'pwr', 'pin', 'homogenized', 'cross', 'section', 'analysis', 'using', 'big', 'data', 'technology', 'jbhi', 'special', 'issue', 'x', 'c', 'big', 'data', 'health', 'x', 'qualitative', 'readiness', 'requirement', 'assessment', 'model', 'enterprise', 'big', 'data', 'infrastructure', 'investment', 'economics', 'econophysics', 'era', 'big', 'data', 'fine', 'grained', 'dynamic', 'resource', 'allocation', 'big', 'data', 'application', 'predicting', 'pattern', 'technology', 'convergence', 'using', 'big', 'data', 'technology', 'large', 'scale', 'triadic', 'patent', 'china', 'kidney', 'disease', 'network', 'ck', 'net', 'big', 'data', 'big', 'dream', 'symposium', 'insure', 'tech', 'digitalization', 'big', 'data', 'technique', 'risk', 'management', 'insurance', 'editorial', 'note', 'hci', 'system', 'big', 'data', 'based', 'multimedia', 'application', 'randomized', 'block', 'proximal', 'method', 'distributed', 'stochastic', 'big', 'data', 'optimization', 'kernel', 'based', 'framework', 'medical', 'big', 'data', 'analytics', 'sieben', 'schritten', 'zum', 'erfolgreichen', 'big', 'data', 'projekt', 'power', 'message', 'network', 'big', 'data', 'analysis', 'network', 'agenda', 'setting', 'model', 'issue', 'ownership', 'architecting', 'time', 'critical', 'big', 'data', 'system', 'computation', 'migration', 'new', 'approach', 'execute', 'big', 'data', 'bioinformatics', 'workflow', 'trust', 'based', 'crowdfunding', 'campaign', 'marketing', 'framework', 'theoretical', 'underpinnings', 'big', 'data', 'analytics', 'practice', 'correction', 'predictive', 'modeling', 'big', 'data', 'bigger', 'really', 'better', 'junqu', 'de', 'fortuny', 'e', 'marten', 'provost', 'f', 'big', 'data', 'scientific', 'training', 'era', 'big', 'data', 'new', 'pedagogy', 'graduate', 'education', 'capturing', 'value', 'biosurveillance', 'big', 'data', 'natural', 'capital', 'accounting', 'diversity', 'big', 'data', 'review', 'scalable', 'data', 'quality', 'big', 'data', 'pythia', 'framework', 'handling', 'missing', 'value', 'structural', 'consequence', 'big', 'data', 'driven', 'education', 'bring', 'big', 'data', 'beyond', 'hype', 'role', 'teamwork', 'analysis', 'big', 'data', 'study', 'visual', 'analytics', 'box', 'office', 'prediction', 'big', 'scholarly', 'data', 'survey', 'big', 'boe', 'fusing', 'spanish', 'official', 'gazette', 'big', 'data', 'technology', 'sport', 'analytics', 'era', 'big', 'data', 'moving', 'toward', 'next', 'frontier', 'global', 'water', 'cycle', 'remote', 'sensing', 'big', 'data', 'overview', 'challenge', 'opportunity', 'big', 'data', 'artificial', 'intelligence', 'fusion', 'technology', 'education', 'context', 'new', 'crown', 'epidemic', 'big', 'data', 'new', 'method', 'idea', 'geological', 'scientific', 'research', 'sport', 'analytics', 'era', 'big', 'data', 'moving', 'toward', 'next', 'frontier', 'data', 'science', 'relationship', 'big', 'data', 'data', 'driven', 'decision', 'making', 'author', 'response', 'gong', 'comment', 'data', 'science', 'relationship', 'big', 'data', 'data', 'driven', 'decision', 'making', 'computational', 'propaganda', 'political', 'big', 'data', 'moving', 'toward', 'critical', 'research', 'agenda', 'call', 'special', 'issue', 'paper', 'big', 'data', 'driven', 'futuristic', 'fabric', 'system', 'societal', 'digital', 'transformation', 'call', 'special', 'issue', 'paper', 'big', 'data', 'driven', 'futuristic', 'fabric', 'system', 'societal', 'digital', 'transformation', 'multiple', 'target', 'tracking', 'big', 'data', 'based', 'measurement', 'extended', 'binary', 'phase', 'shift', 'keying', 'transceiver', 'big', 'data', 'small', 'persona', 'algorithm', 'shape', 'demographic', 'representation', 'data', 'driven', 'user', 'segment', 'call', 'special', 'issue', 'paper', 'big', 'data', 'driven', 'futuristic', 'fabric', 'system', 'societal', 'digital', 'transformation', 'mapping', 'essential', 'urban', 'land', 'use', 'category', 'euluc', 'using', 'geospatial', 'big', 'data', 'progress', 'challenge', 'opportunity', 'big', 'data', 'based', 'rf', 'localisation', 'method', 'unmanned', 'search', 'rescue', 'guest', 'editorial', 'big', 'medium', 'data', 'understanding', 'search', 'mining', 'guest', 'editorial', 'big', 'medium', 'data', 'understanding', 'search', 'mining', 'guest', 'editorial', 'big', 'medium', 'data', 'understanding', 'search', 'mining', 'part', 'special', 'issue', 'biomedical', 'big', 'data', 'understanding', 'learning', 'application', 'new', 'approach', 'accurate', 'distributed', 'cluster', 'analysis', 'big', 'data', 'competitive', 'k', 'mean', 'new', 'approach', 'accurate', 'distributed', 'cluster', 'analysis', 'big', 'data', 'competitive', 'k', 'mean', 'privacy', 'model', 'big', 'data', 'survey', 'secdedoop', 'secure', 'deduplication', 'access', 'control', 'big', 'data', 'hdfs', 'hadoop', 'environment', 'call', 'special', 'issue', 'paper', 'cloud', 'computing', 'big', 'data', 'cognitive', 'iot', 'spatial', 'pattern', 'urban', 'green', 'space', 'actual', 'utilization', 'status', 'china', 'based', 'big', 'data', 'analysis', 'call', 'special', 'issue', 'paper', 'deep', 'learning', 'assisted', 'big', 'data', 'analytics', 'biomedical', 'application', 'digital', 'healthcare', 'earth', 'observation', 'geospatial', 'big', 'data', 'management', 'engagement', 'stakeholder', 'hungary', 'support', 'sdgs', 'atmospheric', 'ecosystem', 'big', 'data', 'providing', 'key', 'contribution', 'reaching', 'united', 'nation', 'sustainable', 'development', 'goal', 'strength', 'number', 'using', 'big', 'data', 'simplify', 'sentiment', 'classification', 'big', 'data', 'lost', 'cloud', 'artificial', 'bee', 'colony', 'reinforced', 'extended', 'kalman', 'filter', 'localization', 'algorithm', 'internet', 'thing', 'big', 'data', 'blending', 'technique', 'finding', 'accurate', 'position', 'reference', 'node', 'labor', 'economist', 'get', 'microscope', 'big', 'data', 'labor', 'market', 'analysis', 'call', 'paper', 'special', 'issue', 'computational', 'propaganda', 'political', 'big', 'data', 'agricultural', 'disaster', 'risk', 'management', 'capability', 'assessment', 'using', 'big', 'data', 'analytics', 'national', 'survey', 'population', 'health', 'big', 'data', 'analytics', 'mobile', 'health', 'monitor', 'big', 'data', 'support', 'sustainable', 'development', 'goal', 'celebration', 'establishment', 'international', 'research', 'center', 'big', 'data', 'sustainable', 'development', 'goal', 'cbas', 'message', 'editor', 'chief', 'big', 'data', 'using', 'big', 'data', 'understand', 'human', 'condition', 'kavli', 'human', 'project', 'big', 'data', 'lost', 'cloud', 'big', 'data', 'healthcare', 'opportunity', 'challenge', 'data', 'science', 'oceanography', 'small', 'data', 'big', 'data', 'call', 'paper', 'big', 'data', 'big', 'data', 'support', 'sustainable', 'development', 'goal', 'continued', 'celebration', 'establishment', 'international', 'research', 'center', 'big', 'data', 'sustainable', 'development', 'goal', 'cbas', 'call', 'special', 'issue', 'paper', 'big', 'scientific', 'data', 'machine', 'learning', 'science', 'engineering', 'rst', 'de', 'rough', 'set', 'based', 'new', 'differential', 'evolution', 'algorithm', 'scalable', 'big', 'data', 'feature', 'selection', 'distributed', 'computing', 'platform', 'survey', 'biological', 'data', 'big', 'data', 'perspective', 'call', 'special', 'issue', 'paper', 'big', 'data', 'analytics', 'intelligent', 'system', 'cybersecurity', 'data', 'behaviour', 'model', 'big', 'data', 'visual', 'analytics', 'call', 'special', 'issue', 'paper', 'big', 'scientific', 'data', 'machine', 'learning', 'science', 'engineering', 'big', 'data', 'analysis', 'swimming', 'pool', 'impact', 'household', 'electric', 'intensity', 'san', 'antonio', 'texas', 'exploiting', 'reused', 'based', 'sharing', 'work', 'opportunity', 'big', 'data', 'multiquery', 'optimization', 'flink', 'big', 'data', 'robotics', 'big', 'data', 'literature', 'search', 'big', 'data', 'machine', 'analyze', 'stock', 'market', 'sentiment', 'call', 'special', 'issue', 'paper', 'big', 'scientific', 'data', 'machine', 'learning', 'science', 'engineering', 'research', 'dilemma', 'behavioral', 'big', 'data', 'challenge', 'big', 'data', 'earth', 'efficient', 'ensemble', 'based', 'classification', 'framework', 'big', 'medical', 'data', 'call', 'special', 'issue', 'paper', 'big', 'data', 'internet', 'thing', 'complex', 'information', 'system', 'selection', 'iotbds', 'complexis', 'call', 'special', 'issue', 'paper', 'internet', 'medical', 'thing', 'big', 'data', 'pervasive', 'medical', 'care', 'call', 'special', 'issue', 'paper', 'big', 'data', 'analytics', 'agricultural', 'disaster', 'management', 'call', 'special', 'issue', 'paper', 'programming', 'model', 'algorithm', 'big', 'data', 'mr', 'dp', 'improving', 'performance', 'resource', 'utilization', 'big', 'data', 'application', 'deadline', 'priority', 'u', 'army', 'person', 'event', 'data', 'environment', 'military', 'civilian', 'big', 'data', 'enterprise', 'monkeyking', 'adaptive', 'parameter', 'tuning', 'big', 'data', 'platform', 'deep', 'reinforcement', 'learning', 'call', 'special', 'issue', 'paper', 'multimedia', 'big', 'data', 'analytics', 'engineering', 'education', 'call', 'special', 'issue', 'paper', 'soft', 'computing', 'model', 'big', 'data', 'internet', 'thing', 'moth', 'flame', 'optimization', 'bat', 'optimization', 'map', 'reduce', 'framework', 'big', 'data', 'clustering', 'using', 'moth', 'flame', 'bat', 'optimization', 'sparse', 'fuzzy', 'c', 'mean', 'call', 'special', 'issue', 'paper', 'soft', 'computing', 'model', 'big', 'data', 'internet', 'thing', 'call', 'special', 'issue', 'paper', 'multimedia', 'big', 'data', 'analytics', 'engineering', 'education', 'big', 'data', 'business', 'bibliometric', 'analysis', 'relevant', 'literature', 'remotely', 'sensed', 'big', 'data', 'ocean', 'polar', 'region', 'big', 'data', 'based', 'security', 'analytics', 'protecting', 'virtualized', 'infrastructure', 'cloud', 'computing', 'unstructured', 'big', 'data', 'analytical', 'method', 'firm', 'conceptual', 'model', 'measurement', 'perception', 'call', 'special', 'issue', 'paper', 'big', 'data', 'analytics', 'agricultural', 'disaster', 'management', 'call', 'special', 'issue', 'paper', 'big', 'data', 'analytics', 'industrial', 'intelligence', 'big', 'data', 'social', 'good', 'call', 'special', 'issue', 'paper', 'cloud', 'computing', 'big', 'data', 'cognitive', 'iot', 'call', 'special', 'issue', 'paper', 'deep', 'learning', 'assisted', 'big', 'data', 'analytics', 'biomedical', 'application', 'digital', 'healthcare', 'call', 'paper', 'special', 'issue', 'computational', 'propaganda', 'political', 'big', 'data', 'optimal', 'feature', 'selection', 'big', 'data', 'classification', 'firefly', 'lion', 'assisted', 'model', 'online', 'analytical', 'processing', 'business', 'intelligence', 'big', 'data', 'big', 'data', 'application', 'guangzhou', 'restaurant', 'analysis', 'call', 'special', 'issue', 'paper', 'programming', 'model', 'algorithm', 'big', 'data', 'medical', 'school', 'training', 'relate', 'practice', 'evidence', 'big', 'data', 'big', 'data', 'little', 'data', 'care', 'coordination', 'medicare', 'beneficiary', 'medigap', 'coverage', 'big', 'data', 'drive', 'development', 'earth', 'science', 'reference', 'architecture', 'big', 'data', 'solution', 'introducing', 'model', 'perform', 'predictive', 'analytics', 'using', 'big', 'data', 'technology', 'big', 'earth', 'data', 'comprehensive', 'analysis', 'visualization', 'analytics', 'issue', 'big', 'earth', 'data', 'new', 'frontier', 'earth', 'information', 'science', 'gsio', 'programmatic', 'interface', 'delivering', 'big', 'earth', 'data', 'service', 'view', 'based', 'model', 'data', 'cube', 'support', 'big', 'earth', 'data', 'system', 'interoperability', 'geospatial', 'semantics', 'ontology', 'knowledge', 'graph', 'big', 'earth', 'data', 'big', 'earth', 'data', 'analytics', 'survey', 'efficient', 'utilization', 'multi', 'core', 'processor', 'many', 'core', 'co', 'processor', 'supercomputer', 'beacon', 'scalable', 'geocomputation', 'geo', 'simulation', 'big', 'earth', 'data', 'exploiting', 'big', 'earth', 'data', 'space', 'first', 'experience', 'timescan', 'processing', 'chain', 'role', 'big', 'earth', 'data', 'understanding', 'climate', 'change', 'big', 'earth', 'data', 'analytics', 'sentinel', 'landsat', 'imagery', 'support', 'global', 'human', 'settlement', 'mapping', 'stewardship', 'analysis', 'big', 'earth', 'observation', 'data', 'analytics', 'big', 'geosocial', 'medium', 'crowdsourced', 'data', 'big', 'spatial', 'vector', 'data', 'management', 'review', 'generalized', 'supervised', 'classification', 'scheme', 'produce', 'provincial', 'wetland', 'inventory', 'map', 'application', 'google', 'earth', 'engine', 'big', 'geo', 'data', 'processing', 'big', 'earth', 'data', 'facilitates', 'sustainable', 'development', 'goal', 'evaluating', 'role', 'partnership', 'increasing', 'use', 'big', 'earth', 'data', 'support', 'sustainable', 'development', 'goal', 'australian', 'perspective', 'semantics', 'way', 'semantic', 'web', 'open', 'science', 'big', 'earth', 'data', 'innovative', 'approach', 'sustainable', 'development', 'goal', 'using', 'big', 'earth', 'data', 'visual', 'analytics', 'big', 'open', 'public', 'transport', 'data', 'framework', 'pipeline', 'monitoring', 'system', 'performance', 'greater', 'sydney', 'big', 'earth', 'data', 'achieving', 'sustainable', 'development', 'goal', 'belt', 'road', 'region', 'impact', 'wechat', 'public', 'platform', 'blood', 'donation', 'behavior', 'big', 'data', 'based', 'study', 'simd', 'parallel', 'mcmc', 'sampling', 'application', 'big', 'data', 'bayesian', 'analytics', 'review', 'sentiment', 'discovery', 'analysis', 'educational', 'big', 'data', 'data', 'analysis', 'big', 'data', 'improving', 'map', 'shuffle', 'phase', 'hadoop', 'map', 'reduce', 'big', 'bigger', 'biggest', 'data', 'repository', 'balloon', 'unprecedented', 'level', 'coping', 'big', 'data', 'big', 'data', 'analytics', 'review', 'data', 'mining', 'model', 'small', 'medium', 'enterprise', 'transportation', 'sector', 'big', 'data', 'analytics', 'review', 'data', 'mining', 'model', 'small', 'medium', 'enterprise', 'transportation', 'sector', 'transforming', 'big', 'data', 'smart', 'data', 'insight', 'use', 'k', 'nearest', 'neighbor', 'algorithm', 'obtain', 'quality', 'data', 'forecasting', 'big', 'data', 'review', 'semantics', 'meet', 'big', 'data', 'formal', 'model', 'practical', 'issue', 'novel', 'paradigm', 'organisation', 'leverage', 'big', 'data', 'maturity', 'model', 'quantile', 'regression', 'big', 'data', 'divide', 'conquer', 'based', 'strategy', 'visualization', 'statistical', 'modeling', 'financial', 'big', 'data', 'double', 'log', 'modeling', 'skew', 'symmetric', 'error', 'distribution', 'big', 'data', 'big', 'result', 'knowledge', 'discovery', 'output', 'large', 'scale', 'analytics', 'big', 'data', 'technology', 'process', 'spatial', 'attribute', 'data', 'designing', 'operating', 'mine', 'engineering', 'system', 'data', 'discretization', 'taxonomy', 'big', 'data', 'challenge', 'big', 'data', 'challenge', 'big', 'science', 'big', 'data', 'always', 'yield', 'big', 'benefit', 'data', 'mining', 'big', 'data', 'big', 'data', 'big', 'analysis', 'perspective', 'geographical', 'condition', 'monitoring', 'spatial', 'data', 'mining', 'perspective', 'big', 'data', 'using', 'data', 'build', 'better', 'em', 'em', 'big', 'data', 'automated', 'data', 'slicing', 'model', 'validation', 'big', 'data', 'ai', 'integration', 'approach', 'theme', 'data', 'mining', 'big', 'data', 'crime', 'analytics', 'detailed', 'review', 'prominent', 'compression', 'method', 'used', 'reducing', 'data', 'volume', 'big', 'data', 'divide', 'recombine', 'r', 'data', 'science', 'project', 'deep', 'analysis', 'big', 'data', 'high', 'computational', 'complexity', 'overview', 'online', 'based', 'platform', 'sharing', 'analyzing', 'electrophysiology', 'data', 'big', 'data', 'perspective', 'overview', 'online', 'based', 'platform', 'sharing', 'analyzing', 'electrophysiology', 'data', 'big', 'data', 'perspective', 'survey', 'data', 'collection', 'machine', 'learning', 'big', 'data', 'ai', 'integration', 'perspective', 'haery', 'hadoop', 'based', 'query', 'system', 'accumulative', 'high', 'dimensional', 'data', 'model', 'big', 'data', 'gamma', 'matrix', 'summarize', 'dense', 'sparse', 'data', 'set', 'big', 'data', 'analytics', 'big', 'data', 'processing', 'tool', 'experimental', 'performance', 'evaluation', 'applying', 'big', 'data', 'analytics', 'support', 'kansei', 'engineering', 'hotel', 'service', 'development', 'big', 'data', 'analytics', 'security', 'criminal', 'investigation', 'big', 'data', 'analytics', 'machine', 'learning', 'bayesian', 'learning', 'perspective', 'done', 'big', 'data', 'portfolio', 'allocation', 'new', 'approach', 'successful', 'portfolio', 'optimization', 'big', 'data', 'analytics', 'swarm', 'intelligence', 'big', 'data', 'analytics', 'single', 'cell', 'transcriptomics', 'five', 'grand', 'opportunity', 'big', 'data', 'analytics', 'enabled', 'cyber', 'physical', 'system', 'model', 'application', 'dynamic', 'distributed', 'parallel', 'machine', 'learning', 'algorithm', 'big', 'data', 'mining', 'processing', 'big', 'data', 'acquired', 'internet', 'thing', 'enabled', 'industrial', 'multichannel', 'wireless', 'sensor', 'network', 'active', 'monitoring', 'control', 'smart', 'grid', 'industry', 'analysis', 'academic', 'librarian', 'competency', 'skill', 'implementation', 'big', 'data', 'analytics', 'library', 'chicken', 'swarm', 'foraging', 'algorithm', 'big', 'data', 'classification', 'using', 'deep', 'belief', 'network', 'classifier', 'entity', 'deduplication', 'big', 'data', 'graph', 'scholarly', 'communication', 'quality', 'assessment', 'peer', 'produced', 'content', 'knowledge', 'repository', 'using', 'big', 'data', 'social', 'network', 'case', 'implicit', 'collaboration', 'wikipedia', 'big', 'data', 'analytics', 'apache', 'spark', 'big', 'data', 'internet', 'thing', 'revised', 'knowledge', 'pyramid', 'special', 'issue', 'first', 'international', 'conference', 'big', 'data', 'smart', 'computing', 'bigcomp', 'big', 'data', 'analytics', 'security', 'criminal', 'investigation', 'big', 'data', 'fusion', 'big', 'data', 'improves', 'care', 'big', 'data', 'paradigm', 'status', 'privacy', 'security', 'big', 'data', 'cloud', 'computing', 'insight', 'computing', 'environment', 'mapreduce', 'programming', 'framework', 'michael', 'buck', 'talk', 'living', 'dream', 'bioinformaticist', 'emerging', 'alternative', 'big', 'data', 'warehouse', 'practical', 'big', 'data', 'analytics', 'partner', 'allina', 'health', 'team', 'health', 'catalyst', 'garner', 'benefit', 'big', 'data', 'tensor', 'based', 'big', 'data', 'management', 'scheme', 'dimensionality', 'reduction', 'problem', 'smart', 'grid', 'system', 'sdn', 'perspective', 'memory', 'big', 'data', 'management', 'processing', 'survey', 'managing', 'big', 'data', 'coal', 'fired', 'power', 'plant', 'business', 'intelligence', 'framework', 'exploring', 'big', 'data', 'analysis', 'fundamental', 'scientific', 'problem', 'mongodb', 'based', 'modular', 'ontology', 'building', 'big', 'data', 'integration', 'china', 'deal', 'big', 'data', 'flexible', 'heavy', 'tailed', 'distribution', 'big', 'data', 'semantics', 'aware', 'approach', 'big', 'data', 'engineering', 'fuzzy', 'rule', 'based', 'classification', 'system', 'big', 'data', 'mapreduce', 'granularity', 'analysis', 'cell', 'phone', 'big', 'data', 'compute', 'mobility', 'scenario', 'future', 'smart', 'city', 'smart', 'city', 'big', 'data', 'analytics', 'advanced', 'review', 'intelligent', 'approach', 'big', 'data', 'analytics', 'sustainable', 'retail', 'environment', 'using', 'apriori', 'mapreduce', 'framework', 'detection', 'straggler', 'optimal', 'rescheduling', 'slow', 'running', 'task', 'big', 'data', 'environment', 'using', 'lfcso', 'lvq', 'classifier', 'enhanced', 'pso', 'algorithm', 'distributed', 'subdata', 'selection', 'big', 'data', 'via', 'sampling', 'based', 'approach', 'unlocking', 'causal', 'relation', 'barrier', 'big', 'data', 'analytics', 'manufacturing', 'firm', 'efficient', 'skyline', 'computation', 'big', 'data', 'telecom', 'fraud', 'detection', 'big', 'data', 'analytics', 'bayesian', 'consensus', 'clustering', 'lime', 'security', 'big', 'data', 'watchdog', 'approach', 'name', 'matching', 'algorithm', 'big', 'data', 'risk', 'intelligence', 'application', 'hyper', 'convergent', 'platform', 'big', 'data', 'exploring', 'regional', 'innovation', 'system', 'supervised', 'compression', 'big', 'data', 'incremental', 'query', 'processing', 'big', 'data', 'stream', 'analysing', 'repeat', 'blood', 'donation', 'behavior', 'via', 'big', 'data', 'mr', 'plot', 'big', 'data', 'tool', 'distinguishing', 'distribution', 'scalable', 'machine', 'learning', 'algorithm', 'big', 'data', 'analytics', 'comprehensive', 'review', 'modeling', 'evaluating', 'effect', 'big', 'data', 'storage', 'resource', 'allocation', 'global', 'scale', 'cloud', 'architecture', 'online', 'updating', 'method', 'correct', 'measurement', 'error', 'big', 'data', 'stream', 'responding', 'market', 'turbulence', 'big', 'data', 'analytics', 'mass', 'customization', 'capability', 'holistic', 'view', 'big', 'data', 'barrier', 'embedding', 'big', 'data', 'solution', 'smart', 'factory', 'insight', 'sap', 'consultant', 'exploring', 'path', 'big', 'data', 'analytics', 'implementation', 'success', 'banking', 'financial', 'service', 'integrated', 'approach', 'insight', 'application', 'big', 'data', 'analytics', 'healthcare', 'comparing', 'world', 'regional', 'sustainable', 'supply', 'chain', 'finance', 'using', 'big', 'data', 'analytics', 'bibliometric', 'analysis', 'framework', 'evaluating', 'design', 'methodology', 'big', 'data', 'warehouse', 'measurement', 'design', 'process', 'stakeholder', 'perception', 'word', 'mouth', 'csr', 'dynamic', 'big', 'data', 'analysis', 'twitter', 'advancing', 'science', 'technology', 'big', 'data', 'analytics', 'predicting', 'rating', 'amazon', 'product', 'using', 'big', 'data', 'bit', 'oriented', 'sampling', 'aggregation', 'big', 'data', 'web', 'medium', 'stock', 'market', 'survey', 'future', 'direction', 'big', 'data', 'perspective', 'k', 'nearest', 'neighbour', 'join', 'big', 'data', 'mapreduce', 'theoretical', 'experimental', 'analysis', 'distributed', 'bayesian', 'matrix', 'decomposition', 'big', 'data', 'mining', 'clustering', 'bias', 'correction', 'small', 'sample', 'big', 'data', 'feasibility', 'distributed', 'kernel', 'regression', 'big', 'data', 'mapreduce', 'incremental', 'mapreduce', 'mining', 'evolving', 'big', 'data', 'enhanced', 'graph', 'transforming', 'v', 'algorithm', 'non', 'simple', 'graph', 'big', 'data', 'pre', 'processing', 'pcce', 'twitter', 'personality', 'based', 'communicative', 'community', 'extraction', 'system', 'big', 'data', 'multi', 'step', 'nonlinear', 'dimension', 'reduction', 'approach', 'application', 'big', 'data', 'semi', 'automatic', 'design', 'methodology', 'big', 'data', 'warehouse', 'transforming', 'fact', 'dimension', 'va', 'store', 'virtual', 'approximate', 'store', 'approach', 'supporting', 'repetitive', 'big', 'data', 'genome', 'sequence', 'analysis', 'mining', 'suspicious', 'tax', 'evasion', 'group', 'big', 'data', 'survey', 'spark', 'ecosystem', 'big', 'data', 'processing', 'infrastructure', 'machine', 'learning', 'application', 'novel', 'pipeline', 'approach', 'efficient', 'big', 'data', 'broadcasting', 'uncertainty', 'measure', 'fusion', 'rule', 'conflict', 'evidence', 'big', 'data', 'via', 'dempster', 'shafer', 'theory', 'privacy', 'preserving', 'distributed', 'contextual', 'federated', 'online', 'learning', 'framework', 'big', 'data', 'support', 'social', 'recommender', 'system', 'security', 'privacy', 'implication', 'database', 'system', 'big', 'data', 'era', 'survey', 'data', 'driven', 'sequential', 'localization', 'framework', 'big', 'telco', 'data', 'towards', 'big', 'linked', 'data', 'large', 'scale', 'distributed', 'semantic', 'data', 'storage', 'effective', 'efficient', 'distributed', 'management', 'big', 'clinical', 'data', 'framework', 'scalable', 'multi', 'data', 'source', 'based', 'recursive', 'approximation', 'approach', 'fast', 'error', 'recovery', 'big', 'sensing', 'data', 'cloud', 'scalable', 'data', 'chunk', 'similarity', 'based', 'compression', 'approach', 'efficient', 'big', 'sensing', 'data', 'processing', 'cloud', 'extracting', 'kernel', 'dataset', 'big', 'sensory', 'data', 'wireless', 'sensor', 'network', 'skia', 'scalable', 'efficient', 'memory', 'analytics', 'big', 'spatial', 'textual', 'data', 'efficient', 'distributed', 'generalized', 'canonical', 'correlation', 'analysis', 'big', 'multiview', 'data', 'scalable', 'algorithm', 'nearest', 'neighbor', 'join', 'big', 'trajectory', 'data', 'parallel', 'fractional', 'hot', 'deck', 'imputation', 'variance', 'estimation', 'big', 'incomplete', 'data', 'curing', 'vecchia', 'laplace', 'approximation', 'generalized', 'gaussian', 'process', 'big', 'non', 'gaussian', 'spatial', 'data', 'emulated', 'order', 'identification', 'model', 'big', 'time', 'series', 'data', 'incremental', 'factorization', 'big', 'time', 'series', 'data', 'blind', 'factor', 'approximation', 'handling', 'big', 'dimension', 'distrubted', 'data', 'warehouse', 'using', 'dws', 'technique', 'data', 'access', 'big', 'competitive', 'advantage', 'evidence', 'china', 'car', 'hailing', 'platform', 'eu', 'data', 'privacy', 'law', 'face', 'big', 'shake', 'big', 'data', 'analytics', 'course', 'given', 'student', 'expectation', 'industry', 'demand', 'time', 'frame', 'content', 'make', 'analytics', 'course', 'big', 'data', 'friendly', 'big', 'data', 'transformation', 'heterogeneous', 'data', 'semantically', 'enriched', 'simplified', 'data', 'editorial', 'commentary', 'mo', 'data', 'mo', 'problem', 'big', 'data', 'national', 'database', 'study', 'orthopaedic', 'research', 'concern', 'generalizability', 'oracle', 'sparc', 'accelerates', 'big', 'data', 'oracle', 'recent', 'sparc', 'processor', 'integrate', 'hardware', 'acceleration', 'database', 'software', 'big', 'data', 'analytics', 'security', 'also', 'enable', 'compression', 'memory', 'database', 'working', 'field', 'big', 'data', 'using', 'big', 'data', 'augmented', 'online', 'ethnography', 'study', 'candidate', 'candidate', 'interaction', 'election', 'time', 'ready', 'big', 'data', 'governance', 'big', 'data', 'research', 'ready', 'big', 'data', 'governance', 'big', 'data', 'research', 'hazy', 'making', 'easier', 'build', 'maintain', 'big', 'data', 'analytics', 'racing', 'unleash', 'full', 'potential', 'big', 'data', 'latest', 'statistical', 'machine', 'learning', 'technique', 'big', 'data', 'based', 'power', 'battery', 'recycling', 'new', 'energy', 'vehicle', 'information', 'sharing', 'platform', 'intelligent', 'transportation', 'optimization', 'big', 'data', 'clustering', 'interval', 'type', 'fuzzy', 'uncertainty', 'modeling', 'gene', 'expression', 'datasets', 'big', 'data', 'analysis', 'phase', 'formation', 'rule', 'high', 'entropy', 'alloy', 'big', 'data', 'big', 'compute', 'technology', 'realm', 'possibility', 'emergence', 'real', 'time', 'risk', 'enterprise', 'capital', 'market', 'industry', 'big', 'data', 'driven', 'safety', 'decision', 'making', 'conceptual', 'framework', 'influencing', 'factor', 'big', 'data', 'machine', 'learning', 'revamp', 'computational', 'toxicology', 'use', 'risk', 'assessment', 'big', 'data', 'theoretical', 'engineering', 'analytics', 'perspective', 'big', 'data', 'method', 'advance', 'public', 'health', 'informatics', 'smarter', 'public', 'health', 'prevention', 'system', 'sphps', 'big', 'data', 'approach', 'protein', 'structure', 'prediction', 'metagenomics', 'sequence', 'data', 'give', 'protein', 'structure', 'prediction', 'boost', 'big', 'data', 'based', 'real', 'time', 'interactive', 'growth', 'management', 'system', 'wireless', 'communication', 'big', 'data', 'analysis', 'carbon', 'emission', 'reduction', 'car', 'towards', 'walkable', 'green', 'smart', 'community', 'big', 'data', 'analysis', 'cluster', 'analysis', 'machine', 'learning', 'approach', 'big', 'data', 'tensor', 'recovery', 'high', 'dimensional', 'uncertainty', 'quantification', 'process', 'variation', 'big', 'data', 'approach', 'bioinformatics', 'workflow', 'comparative', 'assessment', 'big', 'data', 'application', 'government', 'sector', 'big', 'data', 'streaming', 'application', 'scheduling', 'online', 'learning', 'concept', 'drift', 'detection', 'big', 'data', 'driven', 'stem', 'cell', 'science', 'tissue', 'engineering', 'vision', 'unique', 'opportunity', 'big', 'data', 'cloud', 'service', 'platform', 'growth', 'enterprise', 'adaptive', 'exception', 'handling', 'parallelized', 'data', 'mining', 'big', 'data', 'analytics', 'drive', 'parking', 'policy', 'evaluating', 'adherence', 'meter', 'time', 'limit', 'washington', 'c', 'big', 'data', 'satellite', 'yearly', 'report', 'generation', 'aiaa', 'big', 'data', 'reflection', 'high', 'energy', 'electron', 'diffraction', 'analysis', 'understanding', 'epitaxial', 'film', 'growth', 'process', 'big', 'data', 'analytics', 'critical', 'review', 'future', 'direction', 'data', 'transfer', 'scheduling', 'maximizing', 'throughput', 'big', 'data', 'computing', 'cloud', 'system', 'big', 'data', 'mechanism', 'energy', 'policy', 'design', 'big', 'data', 'challenge', 'advance', 'highlight', 'big', 'issue', 'big', 'data', 'business', 'model', 'critical', 'literature', 'review', 'multiperspective', 'research', 'framework', 'big', 'data', 'science', 'infrastructure', 'impact', 'big', 'data', 'feature', 'screening', 'using', 'bregman', 'divergence', 'big', 'data', 'analytics', 'framework', 'incorporating', 'smallholder', 'sustainable', 'palm', 'oil', 'production', 'big', 'data', 'building', 'energy', 'performance', 'lesson', 'assembling', 'large', 'national', 'database', 'building', 'energy', 'use', 'big', 'data', 'based', 'online', 'state', 'charge', 'estimation', 'energy', 'consumption', 'prediction', 'electric', 'vehicle', 'big', 'data', 'anwendungsf', 'lle', 'methodisch', 'erarbeiten', 'big', 'data', 'technologien', 'der', 'fahrzeugentwicklung', 'big', 'data', 'science', 'infrastructure', 'impact', 'big', 'data', 'visualization', 'big', 'data', 'approach', 'three', 'dimensional', 'building', 'extraction', 'aerial', 'laser', 'scanning', 'big', 'data', 'x', 'ray', 'phase', 'contrast', 'imaging', 'simulation', 'challenge', 'big', 'data', 'question', 'horizontal', 'vertical', 'intelligence', 'discussion', 'disaster', 'management', 'big', 'data', 'based', 'intelligent', 'spectrum', 'sensing', 'heterogeneous', 'spectrum', 'communication', 'g', 'big', 'sensing', 'data', 'curation', 'cloud', 'coming', 'promise', 'scalable', 'cloud', 'data', 'center', 'mitigation', 'next', 'generation', 'iot', 'wireless', 'sensor', 'network', 'big', 'data', 'gesch', 'ftsmodelle', 'die', 'drei', 'seiten', 'der', 'medaille', 'big', 'data', 'based', 'edge', 'biomarkers', 'study', 'dynamical', 'drug', 'sensitivity', 'resistance', 'individual', 'big', 'data', 'analysis', 'human', 'nature', 'relation', 'newspaper', 'coverage', 'big', 'data', 'approach', 'contemporary', 'french', 'politics', 'big', 'data', 'balancing', 'risk', 'opportunity', 'uae', 'perspective', 'big', 'idea', 'big', 'data', 'president', 'message', 'data', 'warehousing', 'olap', 'big', 'data', 'survey', 'state', 'art', 'open', 'problem', 'future', 'challenge', 'big', 'data', 'quick', 'start', 'guide', 'learning', 'practitioner', 'big', 'data', 'next', 'frontier', 'innovation', 'therapeutic', 'healthcare', 'big', 'data', 'challenge', 'social', 'science', 'criticism', 'social', 'knowledge', 'politics', 'big', 'data', 'incoming', 'challenge', 'vehicular', 'ad', 'hoc', 'networking', 'big', 'data', 'connected', 'car', 'shine', 'mobile', 'world', 'congress', 'elsewhere', 'big', 'data', 'revolution', 'transform', 'live', 'work', 'think', 'big', 'data', 'next', 'generation', 'machine', 'big', 'science', 'big', 'data', 'technique', 'technology', 'geoinformatics', 'h', 'karimi', 'editor', 'crc', 'press', 'taylor', 'francis', 'london', 'pp', 'price', 'isbn', 'big', 'data', 'scoring', 'unter', 'dem', 'einfluss', 'der', 'datenschutz', 'grundverordnung', 'big', 'data', 'revolution', 'transform', 'live', 'work', 'think', 'data', 'big', 'small', 'big', 'data', 'enabled', 'large', 'scale', 'group', 'decision', 'making', 'circular', 'economy', 'emerging', 'market', 'context', 'big', 'data', 'challenge', 'big', 'data', 'warum', 'sie', 'noch', 'lange', 'darauf', 'warten', 'sollten', 'big', 'data', 'analytics', 'capability', 'decision', 'making', 'role', 'data', 'driven', 'insight', 'circular', 'economy', 'performance', 'big', 'data', 'opportunity', 'challenge', 'english', 'corpus', 'linguistics', 'big', 'data', 'bad', 'data', 'sensitivity', 'security', 'policy', 'imperfect', 'information', 'big', 'data', 'impact', 'trading', 'technology', 'big', 'data', 'driven', 'contextual', 'processing', 'method', 'electrical', 'capacitance', 'tomography', 'big', 'data', 'guest', 'editorial', 'big', 'data', 'driven', 'cognitive', 'computing', 'system', 'optimization', 'social', 'medium', 'analytics', 'big', 'data', 'approach', 'novel', 'anti', 'cancer', 'drug', 'discovery', 'big', 'data', 'history', 'current', 'status', 'challenge', 'going', 'forward', 'big', 'data', 'goldmine', 'minefield', 'big', 'data', 'impact', 'u', 'medical', 'industry', 'big', 'data', 'im', 'gesundheitsrecht', 'ende', 'der', 'datensparsamkeit', 'big', 'data', 'technique', 'technology', 'geoinformatics', 'edited', 'hassan', 'karimi', 'boca', 'raton', 'fl', 'crc', 'press', 'pp', 'u', 'hardcover', 'isbn', 'big', 'data', 'information', 'security', 'context', 'big', 'data', 'based', 'identification', 'methylated', 'gene', 'associated', 'drug', 'resistance', 'prognosis', 'ovarian', 'cancer', 'big', 'data', 'driven', 'strategic', 'orientation', 'international', 'marketing', 'big', 'data', 'collaboration', 'exploring', 'recording', 'sharing', 'enterprise', 'knowledge', 'big', 'data', 'assisted', 'social', 'medium', 'analytics', 'business', 'model', 'business', 'decision', 'making', 'system', 'competitive', 'analysis', 'big', 'data', 'driven', 'personal', 'protective', 'equipment', 'stockpiling', 'framework', 'universal', 'healthcare', 'system', 'disease', 'control', 'prevention', 'covid', 'era', 'big', 'data', 'pharmaceutical', 'science', 'challenge', 'opportunity', 'big', 'data', 'big', 'knowledge', 'brazilian', 'genomics', 'process', 'academic', 'marketization', 'big', 'data', 'driven', 'business', 'model', 'innovation', 'traditional', 'industry', 'chinese', 'economy', 'big', 'buck', 'backing', 'big', 'data', 'lobbying', 'effort', 'capitol', 'hill', 'big', 'data', 'implication', 'financial', 'manager', 'big', 'data', 'big', 'data', 'new', 'opportunity', 'big', 'data', 'opportunity', 'challenge', 'solution', 'big', 'data', 'driven', 'scheduling', 'optimization', 'algorithm', 'cyber', 'physical', 'system', 'based', 'cloud', 'platform', 'big', 'data', 'beginning', 'future', 'big', 'data', 'pedestrian', 'volume', 'exploring', 'use', 'google', 'street', 'view', 'image', 'pedestrian', 'count', 'big', 'data', 'creating', 'right', 'balance', 'big', 'data', 'gaining', 'competitive', 'edge', 'big', 'data', 'research', 'social', 'policy', 'big', 'data', 'oriented', 'product', 'infant', 'failure', 'intelligent', 'root', 'cause', 'identification', 'using', 'associated', 'tree', 'fuzzy', 'dea', 'big', 'data', 'little', 'data', 'scholarship', 'networked', 'world', 'big', 'data', 'dimension', 'evolution', 'impact', 'challenge', 'big', 'data', 'large', 'prospective', 'randomized', 'trial', 'obsolete', 'future', 'big', 'data', 'need', 'analysis', 'decision', 'process', 'big', 'data', 'disparate', 'impact', 'data', 'driven', 'medicinal', 'chemistry', 'era', 'big', 'data', 'big', 'data', 'promise', 'peril', 'big', 'data', 'crunch', 'time', 'pharma', 'big', 'data', 'improve', 'patient', 'centered', 'care', 'big', 'data', 'big', 'handprint', 'big', 'universe', 'big', 'data', 'machine', 'learning', 'image', 'analysis', 'astronomy', 'big', 'data', 'ignorance', 'longer', 'acceptable', 'big', 'data', 'qualitative', 'approach', 'digital', 'research', 'big', 'data', 'come', 'pediatric', 'anesthesia', 'data', 'driven', 'authoritarianism', 'non', 'democracy', 'big', 'data', 'big', 'data', 'dynamic', 'factor', 'model', 'macroeconomic', 'measurement', 'forecasting', 'discussion', 'paper', 'lucrezia', 'reichlin', 'mark', 'w', 'watson', 'big', 'data', 'enabled', 'intelligent', 'synchronisation', 'complex', 'production', 'logistics', 'system', 'opti', 'state', 'control', 'strategy', 'big', 'data', 'small', 'conclusion', 'data', 'envelopment', 'analysis', 'big', 'data', 'systematic', 'literature', 'review', 'bibliometric', 'analysis', 'big', 'data', 'challenge', 'perspective', 'big', 'data', 'driven', 'detection', 'false', 'data', 'injection', 'attack', 'smart', 'meter', 'big', 'data', 'lesson', 'employer', 'employee', 'big', 'data', 'alexander', 'von', 'humboldt', 'approach', 'science', 'big', 'data', 'based', 'approach', 'detect', 'locate', 'enhance', 'stability', 'unplanned', 'microgrid', 'islanding', 'big', 'data', 'whole', 'yotta', 'byte', 'data', 'enabled', 'digestive', 'medicine', 'new', 'big', 'data', 'analytics', 'platform', 'big', 'data', 'based', 'epidemiology', 'uveitis', 'related', 'intraocular', 'inflammation', 'big', 'data', 'utility', 'seek', 'extract', 'value', 'technology', 'asset', 'smart', 'grid', 'metering', 'data', 'becoming', 'gold', 'mine', 'insight', 'improve', 'service', 'save', 'money', 'big', 'data', 'based', 'prediction', 'terrorist', 'attack', 'big', 'data', 'based', 'driving', 'pattern', 'clustering', 'evaluation', 'combination', 'driving', 'circumstance', 'big', 'data', 'analytics', 'healthcare', 'data', 'driven', 'method', 'typical', 'treatment', 'pattern', 'mining', 'data', 'asset', 'upstream', 'oil', 'gas', 'industry', 'learn', 'big', 'data', 'company', 'like', 'social', 'medium', 'big', 'data', 'management', 'revolution', 'data', 'asset', 'oil', 'gas', 'sector', 'learn', 'industry', 'big', 'data', 'big', 'data', 'new', 'information', 'challenge', 'big', 'data', 'alzheimer', 'disease', 'research', 'environmental', 'scan', 'big', 'data', 'understanding', 'creative', 'organisation', 'create', 'sustain', 'network', 'big', 'data', 'promise', 'messy', 'like', 'u', 'big', 'data', 'come', 'pediatric', 'anesthesia', 'big', 'data', 'based', 'attack', 'scenario', 'reconstruction', 'architecture', 'smart', 'grid', 'big', 'data', 'big', 'problem', 'big', 'data', 'managing', 'unmanageable', 'big', 'data', 'based', 'precise', 'diagnosis', 'space', 'range', 'communication', 'system', 'big', 'data', 'educational', 'administration', 'application', 'predicting', 'school', 'dropout', 'risk', 'big', 'data', 'oriented', 'paas', 'architecture', 'disk', 'resource', 'capability', 'container', 'based', 'virtualization', 'big', 'data', 'element', 'good', 'question', 'open', 'data', 'powerful', 'software', 'data', 'driven', 'innovation', 'switching', 'perspective', 'big', 'data', 'big', 'data', 'small', 'child', 'adult', 'height', 'preterm', 'infant', 'treated', 'gh', 'big', 'data', 'panoptic', 'medium', 'information', 'age', 'donepezil', 'use', 'emergence', 'comorbidity', 'patient', 'alzheimer', 'dementia', 'finding', 'real', 'world', 'big', 'data', 'japan', 'big', 'data', 'small', 'data', 'concept', 'big', 'data', 'hard', 'quantify', 'good', 'remember', 'hat', 'big', 'data', 'made', 'small', 'data', 'big', 'data', 'big', 'deal', 'e', 'amp', 'p', 'industry', 'must', 'work', 'big', 'data', 'meet', 'future', 'demand', 'growing', 'global', 'population', 'big', 'data', 'leveraging', 'big', 'data', 'improve', 'subscriber', 'service', 'management', 'big', 'data', 'politics', 'open', 'data', 'case', 'health', 'care', 'data', 'england', 'big', 'data', 'preparing', 'future', 'smart', 'water', 'big', 'data', 'big', 'data', 'friend', 'foe', 'digital', 'advertising', 'five', 'way', 'marketer', 'use', 'digital', 'big', 'data', 'advantage', 'big', 'data', 'introduction', 'big', 'data', 'technique', 'gain', 'competitive', 'advantage', 'company', 'biggest', 'asset', 'data', 'big', 'data', 'big', 'pain', 'big', 'gain', 'lawyer', 'big', 'data', 'risk', 'management', 'financial', 'service', 'industry', 'provides', 'lesson', 'prevent', 'big', 'data', 'becoming', 'big', 'problem', 'big', 'data', 'big', 'gain', 'big', 'pain', 'lawyer', 'huge', 'quantity', 'data', 'offer', 'potential', 'boon', 'lawyer', 'cause', 'headache', 'big', 'data', 'fourth', 'data', 'management', 'generation', 'big', 'data', 'savvy', 'team', 'skill', 'big', 'data', 'driven', 'action', 'business', 'performance', 'big', 'security', 'big', 'data', 'addressing', 'security', 'challenge', 'big', 'data', 'infrastructure', 'big', 'data', 'big', 'opportunity', 'big', 'challenge', 'big', 'data', 'mining', 'turning', 'data', 'mining', 'predictive', 'analytics', 'large', 'scale', 'v', 'data', 'future', 'challenge', 'knowledge', 'discovery', 'big', 'data', 'small', 'data', 'data', 'sharing', 'long', 'tail', 'neuroscience', 'big', 'data', 'little', 'data', 'data', 'scholarship', 'networked', 'world', 'christine', 'l', 'borgman', 'cambridge', 'mit', 'press', 'pp', 'hardcover', 'isbn', 'big', 'data', 'big', 'data', 'set', 'big', 'analogue', 'data', 'biggest', 'big', 'data', 'big', 'data', 'little', 'data', 'data', 'scholarship', 'networked', 'world', 'christine', 'l', 'borgman', 'big', 'benefit', 'big', 'data', 'real', 'time', 'data', 'product', 'creation', 'distribution', 'system', 'big', 'data', 'bigger', 'question', 'data', 'based', 'business', 'model', 'implication', 'organizational', 'boundary', 'data', 'governance', 'society', 'data', 'protection', 'era', 'big', 'data', 'challenge', 'posed', 'big', 'personal', 'data', 'big', 'data', 'navigating', 'federal', 'aim', 'state', 'privacy', 'law', 'potential', 'pitfall', 'applying', 'big', 'data', 'healthcare', 'legal', 'expert', 'law', 'firm', 'foley', 'amp', 'lardner', 'offer', 'advice', 'complying', 'federal', 'state', 'privacy', 'law', 'big', 'data', 'big', 'data', 'mean', 'implication', 'marketer', 'brand', 'big', 'data', 'harnessing', 'big', 'data', 'company', 'one', 'way', 'improve', 'operation', 'product', 'development', 'big', 'data', 'little', 'data', 'data', 'scholarship', 'networked', 'world', 'big', 'data', 'meet', 'material', 'science', 'training', 'future', 'generation', 'capitalizing', 'promise', 'big', 'data', 'require', 'material', 'scientist', 'trained', 'data', 'informatics', 'several', 'university', 'answering', 'call', 'big', 'data', 'little', 'data', 'data', 'scholarship', 'networked', 'world', 'big', 'data', 'based', 'improved', 'data', 'acquisition', 'storage', 'system', 'designing', 'industrial', 'data', 'platform', 'big', 'data', 'internal', 'audit', 'liz', 'sandwith', 'chartered', 'institute', 'internal', 'auditor', 'look', 'big', 'data', 'affect', 'internal', 'audit', 'process', 'big', 'data', 'promise', 'pay', 'big', 'dividend', 'swd', 'inc', 'continuous', 'improvement', 'minded', 'provider', 'metal', 'finishing', 'fastener', 'sorting', 'service', 'new', 'data', 'discovery', 'analysis', 'software', 'allows', 'swd', 'management', 'unlock', 'new', 'pathway', 'improved', 'performance', 'serf', 'foundation', 'customer', 'communication', 'strategy', 'future', 'big', 'data', 'device', 'reprocessing', 'among', 'fda', 'device', 'science', 'goal', 'fda', 'device', 'center', 'science', 'council', 'want', 'leverage', 'called', 'big', 'data', 'evidence', 'real', 'world', 'clinical', 'experience', 'improve', 'reprocessing', 'reusable', 'medical', 'device', 'enhance', 'use', 'health', 'information', 'technology', 'device', 'cybersecurity', 'among', 'fy', 'regulatory', 'science', 'goal', 'data', 'driven', 'revolution', 'internet', 'thing', 'big', 'data', 'data', 'analytics', 'cloud', 'computing', 'used', 'create', 'platform', 'independent', 'advanced', 'process', 'control', 'solution', 'help', 'industry', 'optimize', 'operation', 'cognitive', 'big', 'data', 'survey', 'review', 'big', 'data', 'research', 'implication', 'really', 'new', 'big', 'data', 'data', 'driven', 'revolution', 'internet', 'thing', 'big', 'data', 'data', 'analytics', 'cloud', 'computing', 'used', 'create', 'platform', 'independent', 'advanced', 'process', 'control', 'solution', 'help', 'industry', 'optimize', 'operation', 'data', 'say', 'u', 'learning', 'vast', 'ocean', 'data', 'truth', 'measured', 'world', 'measured', 'self', 'excerpted', 'human', 'face', 'big', 'data', 'big', 'data', 'big', 'deal', 'new', 'method', 'understanding', 'torrent', 'data', 'promise', 'save', 'life', 'propel', 'scholarship', 'demystifying', 'big', 'data', 'anatomy', 'big', 'data', 'developmental', 'process', 'thinking', 'big', 'transitioning', 'student', 'working', 'small', 'student', 'collected', 'data', 'set', 'toward', 'big', 'data', 'think', 'big', 'big', 'data', 'identifying', 'suitable', 'big', 'data', 'strategy', 'corporate', 'environment', 'imagining', 'big', 'data', 'illustration', 'big', 'data', 'u', 'news', 'article', 'big', 'smart', 'data', 'iterative', 'ensemble', 'filter', 'noise', 'filtering', 'big', 'data', 'classification', 'editorial', 'big', 'data', 'new', 'drug', 'discovery', 'tackling', 'big', 'data', 'virtual', 'screening', 'large', 'compound', 'database', 'demystifying', 'big', 'data', 'anatomy', 'big', 'data', 'developmental', 'process', 'telecommunication', 'policy', 'biobanking', 'big', 'data', 'need', 'developing', 'big', 'data', 'metric', 'little', 'team', 'big', 'data', 'big', 'data', 'provides', 'new', 'opportunity', 'team', 'theory', 'busting', 'big', 'data', 'myth', 'backlash', 'big', 'data', 'led', 'question', 'whether', 'overrated', 'bright', 'north', 'rob', 'hick', 'belief', 'offer', 'opportunity', 'provides', 'pointer', 'make', 'cybersecurity', 'big', 'data', 'era', 'securing', 'big', 'data', 'data', 'driven', 'security', 'publicly', 'available', 'data', 'pediatric', 'mental', 'health', 'leveraging', 'big', 'data', 'answer', 'big', 'question', 'child', 'reining', 'big', 'data', 'semantic', 'database', 'may', 'tool', 'bank', 'need', 'harness', 'big', 'data', 'potential', 'value', 'big', 'data', 'use', 'big', 'data', 'provides', 'industry', 'opportunity', 'improve', 'energy', 'efficiency', 'minimize', 'environmental', 'impact', 'optimize', 'operational', 'performance', 'work', 'industry', 'unlock', 'value', 'towards', 'building', 'data', 'intensive', 'index', 'big', 'data', 'computing', 'case', 'study', 'remote', 'sensing', 'data', 'processing', 'case', 'big', 'data', 'new', 'york', 'city', 'kalvi', 'human', 'project', 'aim', 'use', 'big', 'data', 'resolving', 'big', 'health', 'question', 'christine', 'l', 'borgman', 'big', 'data', 'little', 'data', 'data', 'scholarship', 'networked', 'world', 'convex', 'optimization', 'big', 'data', 'scalable', 'randomized', 'parallel', 'algorithm', 'big', 'data', 'analytics', 'u', 'government', 'bet', 'big', 'data', 'multi', 'agency', 'big', 'data', 'initiative', 'offer', 'array', 'national', 'advantage', 'guest', 'editorial', 'big', 'big', 'data', 'finding', 'space', 'peril', 'big', 'data', 'getting', 'real', 'big', 'data', 'applying', 'critical', 'realism', 'analyse', 'big', 'data', 'hype', 'benefit', 'big', 'data', 'big', 'data', 'challenging', 'acquire', 'use', 'reward', 'boost', 'business', 'performance', 'hybrid', 'optimization', 'big', 'data', 'error', 'detection', 'data', 'repairing', 'big', 'data', 'cleaning', 'using', 'cso', 'gsa', 'sensor', 'collect', 'big', 'data', 'energy', 'efficient', 'big', 'data', 'gathering', 'algorithm', 'wsn', 'political', 'science', 'big', 'data', 'structured', 'data', 'unstructured', 'data', 'use', 'politics', 'big', 'data', 'big', 'data', 'big', 'brother', 'exploratory', 'v', 'solution', 'seeking', 'big', 'data', 'discussion', 'experience', 'big', 'data', 'account', 'data', 'scientist', 'perspective', 'politics', 'big', 'data', 'big', 'data', 'big', 'brother', 'r', 'tnani', 'schneidern', 'green', 'ed', 'londonroutledge', 'pp', 'isbn', 'organizational', 'performance', 'capability', 'analyze', 'big', 'data', 'ambidexterity', 'business', 'value', 'big', 'data', 'analytics', 'matter', 'control', 'use', 'ownership', 'big', 'data', 'reciprocal', 'view', 'customer', 'big', 'data', 'value', 'hospitality', 'tourism', 'industry', 'school', 'cio', 'afraid', 'big', 'data', 'big', 'data', 'overwhelming', 'also', 'furnish', 'important', 'information', 'individual', 'student', 'school', 'district', 'anamorphic', 'stretch', 'transform', 'putting', 'squeeze', 'big', 'data', 'coping', 'deluge', 'digital', 'information', 'require', 'efficient', 'way', 'capture', 'sample', 'store', 'data', 'one', 'new', 'approach', 'work', 'selectively', 'warping', 'data', 'provide', 'beiter', 'resolution', 'fine', 'detail', 'still', 'reducing', 'total', 'data', 'size', 'oncology', 'reimbursement', 'era', 'personalized', 'medicine', 'big', 'data', 'little', 'written', 'reimbursement', 'era', 'big', 'data', 'driven', 'personalized', 'medicine', 'way', 'united', 'state', 'pay', 'majority', 'health', 'care', 'hindrance', 'provision', 'personalized', 'oncology', 'care', 'analysis', 'utility', 'risk', 'masked', 'data', 'big', 'data', 'small', 'data', 'analysis', 'radiotherapy', 'rt', 'pattern', 'practice', 'variability', 'identified', 'challenge', 'real', 'world', 'big', 'data', 'recommendation', 'learning', 'analysis', 'multicenter', 'big', 'data', 'aggregation', 'lambda', 'consortium', 'science', 'vocation', 'era', 'big', 'data', 'philosophy', 'science', 'behind', 'big', 'data', 'humanity', 'continued', 'part', 'science', 'aadhaar', 'effect', 'world', 'largest', 'identity', 'project', 'matter', 'dissent', 'aadhaar', 'big', 'data', 'meet', 'big', 'brother', 'data', 'algorithm', 'making', 'digital', 'self', 'alarmingly', 'simplified', 'data', 'gathering', 'head', 'neck', 'radiotherapy', 'rt', 'pattern', 'practice', 'variability', 'identified', 'challenge', 'real', 'world', 'big', 'data', 'recommendation', 'learning', 'analysis', 'multicentre', 'big', 'data', 'aggregation', 'lambda', 'consortium', 'data', 'driven', 'icu', 'management', 'using', 'big', 'data', 'algorithm', 'improve', 'outcome', 'big', 'data', 'prospect', 'challenge', 'big', 'data', 'role', 'expanding', 'access', 'financial', 'service', 'china', 'big', 'data', 'critical', 'question', 'sport', 'society', 'big', 'data', 'based', 'optimization', 'pressure', 'swing', 'adsorption', 'unit', 'syngas', 'purification', 'mapping', 'uncertainty', 'metaheuristic', 'technique', 'big', 'data', 'statistical', 'issue', 'big', 'data', 'theoretical', 'aspect', 'big', 'data', 'model', 'integration', 'ai', 'vector', 'borne', 'disease', 'prediction', 'big', 'data', 'enabled', 'customer', 'relationship', 'management', 'holistic', 'approach', 'big', 'data', 'based', 'estimation', 'ship', 'safety', 'distance', 'distribution', 'port', 'water', 'big', 'data', 'mistake', 'figure', 'fine', 'war', 'number', 'game', 'plus', 'week', 'cultural', 'pick', 'big', 'tiger', 'big', 'data', 'learning', 'social', 'reaction', 'china', 'anticorruption', 'campaign', 'online', 'feedback', 'big', 'data', 'analyse', 'first', 'series', 'katharine', 'bagshaw', 'fca', 'examines', 'scope', 'use', 'data', 'analytics', 'audit', 'big', 'data', 'conceptual', 'modeling', 'rescue', 'big', 'data', 'health', 'care', 'mean', 'make', 'difference', 'big', 'data', 'big', 'bias', 'drug', 'safety', 'data', 'intelligence', 'context', 'big', 'data', 'survey', 'data', 'envelopment', 'analysis', 'big', 'data', 'revisit', 'faster', 'method', 'data', 'driven', 'mary', 'hammon', 'explains', 'leverage', 'potential', 'big', 'data', 'planning', 'practice', 'big', 'data', 'management', 'security', 'application', 'telemetry', 'data', 'product', 'big', 'data', 'patentometrics', 'r', 'decision', 'making', 'data', 'driven', 'decision', 'making', 'precision', 'agriculture', 'rise', 'big', 'data', 'agricultural', 'system', 'big', 'data', 'big', 'opportunity', 'petroleum', 'petrochemical', 'industry', 'data', 'driven', 'artificial', 'intelligence', 'calibration', 'hyperspectral', 'big', 'data', 'big', 'data', 'internet', 'thing', 'internet', 'sign', 'big', 'data', 'driven', 'fuzzy', 'large', 'scale', 'group', 'decision', 'making', 'lsgdm', 'circular', 'economy', 'environment', 'big', 'data', 'driven', 'manufacturing', 'process', 'monitoring', 'quality', 'philosophy', 'big', 'data', 'statistical', 'process', 'control', 'help', 'big', 'data', 'mining', 'meaning', 'data', 'mining', 'bringing', 'order', 'chaos', 'big', 'data', 'expert', 'discus', 'customer', 'data', 'treatment', 'big', 'data', 'boon', 'business', 'intelligence', 'big', 'data', 'big', 'opportunity', 'librarian', 'big', 'data', 'driven', 'personal', 'protective', 'equipment', 'stockpiling', 'framework', 'universal', 'healthcare', 'disease', 'control', 'prevention', 'covid', 'era', 'big', 'data', 'based', 'deterioration', 'prediction', 'model', 'infrastructure', 'management', 'towards', 'assetmetrics', 'big', 'data', 'mental', 'health', 'research', 'current', 'status', 'emerging', 'possibility', 'big', 'data', 'driven', 'precision', 'medicine', 'starting', 'custom', 'made', 'era', 'iatrology', 'data', 'analysis', 'health', 'big', 'data', 'machine', 'learning', 'medical', 'diagnosis', 'model', 'based', 'patient', 'complaint', 'big', 'data', 'present', 'future', 'big', 'data', 'good', 'bad', 'ugly', 'big', 'data', 'shrinking', 'pathogen', 'population', 'big', 'data', 'micropolitics', 'entanglement', 'earth', 'becoming', 'big', 'data', 'revolution', 'transform', 'live', 'work', 'think', 'big', 'data', 'end', 'run', 'around', 'procedural', 'privacy', 'protection', 'recognizing', 'inherent', 'limitation', 'consent', 'anonymity', 'big', 'data', 'look', 'best', 'big', 'data', 'data', 'driven', 'intelligent', 'predictive', 'algorithm', 'support', 'creativity', 'industrial', 'engineering', 'big', 'data', 'manage', 'drown', 'exponential', 'growth', 'data', 'challenging', 'ability', 'process', 'effectively', 'exploit', 'contains', 'big', 'data', 'extending', 'business', 'strategy', 'toolbox', 'big', 'data', 'distributed', 'storage', 'processing', 'online', 'learning', 'system', 'big', 'data', 'meet', 'smart', 'beta', 'big', 'data', 'pharmaceutical', 'science', 'challenge', 'opportunity', 'big', 'data', 'new', 'tend', 'sustainable', 'consumption', 'research', 'coping', 'big', 'data', 'growing', 'pain', 'big', 'data', 'study', 'quantify', 'relationship', 'bmi', 'different', 'cancer', 'using', 'data', 'driven', 'safety', 'decision', 'making', 'realize', 'smart', 'safety', 'management', 'era', 'big', 'data', 'theoretical', 'perspective', 'basic', 'question', 'answer', 'big', 'picture', 'big', 'data', 'solving', 'big', 'data', 'distributing', 'computation', 'among', 'smart', 'device', 'wireless', 'big', 'data', 'transforming', 'heterogeneous', 'network', 'smart', 'network', 'utilizing', 'big', 'data', 'modeling', 'evaluating', 'potential', 'juror', 'using', 'big', 'data', 'versus', 'alternative', 'measure', 'aggregate', 'data', 'predict', 'u', 'presidential', 'election', 'online', 'big', 'data', 'driven', 'oil', 'consumption', 'forecasting', 'google', 'trend', 'data', 'driven', 'ax', 'creation', 'model', 'correlation', 'measurement', 'big', 'data', 'analytics', 'traditional', 'data', 'warehousing', 'meet', 'big', 'data', 'mean', 'enterprise', 'analyzing', 'big', 'data', 'social', 'choice', 'measurement', 'wrangling', 'big', 'data', 'b', 'b', 'smaller', 'medium', 'company', 'need', 'join', 'data', 'party', 'modeling', 'big', 'data', 'based', 'system', 'ontological', 'trading', 'bridging', 'data', 'capacity', 'gap', 'big', 'data', 'storage', 'beyond', 'big', 'data', 'surveillance', 'metadata', 'technology', 'enabled', 'intelligence', 'opportunity', 'counter', 'terrorism', 'harnessing', 'big', 'data', 'capability', 'deliver', 'result', 'regulating', 'big', 'data', 'guideline', 'council', 'europe', 'context', 'european', 'data', 'protection', 'framework', 'big', 'data', 'smart', 'data', 'role', 'metabolomics', 'generating', 'actionable', 'knowledge', 'alzheimer', 'disease', 'personalized', 'treatment', 'optimizing', 'big', 'data', 'big', 'data', 'exist', 'medicinal', 'chemistry', 'harnessed', 'big', 'data', 'based', 'worker', 'behavior', 'observation', 'china', 'metro', 'construction', 'teaching', 'big', 'data', 'analysis', 'young', 'immunologist', 'rethink', 'big', 'european', 'roadmap', 'hardware', 'networking', 'optimization', 'big', 'data', 'big', 'data', 'view', 'tumor', 'immunome', 'integrating', 'big', 'data', 'surgical', 'practice', 'bigger', 'data', 'big', 'data', 'twitter', 'brain', 'computer', 'interface', 'big', 'data', 'business', 'strategy', 'interconnection', 'grand', 'challenge', 'knowledge', 'management', 'review', 'future', 'perspective', 'finance', 'big', 'data', 'management', 'analysis', 'application', 'scholarly', 'big', 'data', 'information', 'extraction', 'data', 'mining', 'demystifying', 'big', 'data', 'mandatory', 'evolution', 'forensic', 'accounting', 'brazil', 'iot', 'based', 'big', 'data', 'smart', 'city', 'towards', 'next', 'generation', 'super', 'city', 'planning', 'data', 'driven', 'decision', 'making', 'big', 'data', 'analysis', 'dns', 'log', 'harnessing', 'big', 'data', 'methodological', 'approach', 'linking', 'electronic', 'health', 'record', 'patient', 'reported', 'survey', 'data', 'integrating', 'big', 'data', 'aquatic', 'ecology', 'challenge', 'opportunity', 'prescription', 'data', 'australia', 'useful', 'big', 'data', 'using', 'big', 'data', 'validate', 'claim', 'made', 'pharmaceutical', 'approval', 'process', 'big', 'data', 'myth', 'pitfall', 'thick', 'data', 'opportunism', 'need', 'different', 'ontology', 'market', 'consumption', 'mining', 'big', 'data', 'philologist', 'perspective', 'big', 'big', 'data', 'embracing', 'big', 'data', 'managing', 'big', 'data', 'cloud', 'computing', 'co', 'location', 'center', 'handling', 'big', 'data', 'video', 'stream', 'professor', 'plamen', 'angelov', 'chair', 'intelligent', 'system', 'lancaster', 'university', 'director', 'entelsensys', 'discus', 'patented', 'new', 'technology', 'predicting', 'updating', 'position', 'target', 'video', 'stream', 'big', 'data', 'different', 'viewpoint', 'big', 'data', 'secret', 'weapon', 'big', 'data', 'smart', 'data', 'algorithm', 'cross', 'evaluation', 'novel', 'method', 'large', 'scale', 'survey', 'analysis', 'geospatial', 'big', 'data', 'new', 'paradigm', 'remote', 'sensing', 'application', 'big', 'data', 'driven', 'framework', 'sustainable', 'smart', 'additive', 'manufacturing', 'administrative', 'data', 'surgeon', 'know', 'big', 'data', 'big', 'data', 'driven', 'root', 'cause', 'analysis', 'system', 'application', 'machine', 'learning', 'quality', 'problem', 'solving', 'data', 'driven', 'approach', 'discovery', 'latest', 'research', 'trend', 'higher', 'education', 'business', 'leveraging', 'advanced', 'technology', 'big', 'data', 'novel', 'big', 'data', 'supported', 'dynamic', 'toll', 'charging', 'system', 'impact', 'assessment', 'portugal', 'shadow', 'toll', 'highway', 'thinking', 'big', 'large', 'publisher', 'beginning', 'realize', 'value', 'big', 'data', 'phenomenon', 'big', 'data', 'driven', 'hr', 'practice', 'improve', 'hr', 'service', 'quality', 'innovation', 'competency', 'smes', 'leveraging', 'big', 'data', 'respiratory', 'medicine', 'data', 'science', 'causal', 'inference', 'precision', 'medicine', 'maximum', 'data', 'resolution', 'efficiency', 'fog', 'computing', 'supported', 'spatial', 'big', 'data', 'processing', 'disaster', 'scenario', 'big', 'data', 'approach', 'global', 'freshwater', 'mussel', 'diversity', 'bivalvia', 'unionoida', 'updated', 'checklist', 'genus', 'specie', 'beyond', 'big', 'data', 'data', 'driven', 'framework', 'business', 'analytics', 'context', 'big', 'data', 'real', 'big', 'data', 'know', 'know', 'youth', 'work', 'leveraging', 'big', 'data', 'enhance', 'effectiveness', 'one', 'health', 'era', 'health', 'informatics', 'big', 'data', 'approach', 'hiv', 'epidemiology', 'prevention', 'big', 'data', 'driven', 'dynamic', 'estimation', 'model', 'relief', 'supply', 'demand', 'urban', 'flood', 'disaster', 'book', 'review', 'big', 'data', 'affect', 'opportunity', 'social', 'strategy', 'growth', 'big', 'data', 'driven', 'approach', 'catering', 'modeling', 'using', 'big', 'data', 'optimally', 'model', 'hydrology', 'water', 'quality', 'across', 'expansive', 'region', 'clouding', 'big', 'data', 'information', 'privacy', 'consideration', 'big', 'picture', 'big', 'data', 'visualization', 'big', 'data', 'enabled', 'consolidated', 'framework', 'energy', 'efficient', 'software', 'defined', 'data', 'center', 'iot', 'setup', 'forget', 'big', 'data', 'beware', 'little', 'data', 'integrating', 'big', 'data', 'simulation', 'predictive', 'analytics', 'real', 'time', 'monitoring', 'data', 'warehousing', 'single', 'cloud', 'application', 'exploring', 'big', 'data', 'next', 'big', 'thing', 'innovation', 'big', 'data', 'application', 'interventional', 'radiology', 'big', 'data', 'big', 'deal', 'big', 'data', 'implication', 'health', 'system', 'pharmacy', 'big', 'data', 'based', 'multimedia', 'transcoding', 'method', 'application', 'multimedia', 'data', 'mining', 'based', 'smart', 'transportation', 'telemedicine', 'big', 'data', 'impact', 'privacy', 'librarian', 'information', 'professional', 'big', 'data', 'collection', 'visualization', 'big', 'data', 'next', 'big', 'thing', 'innovation', 'big', 'data', 'tutorial', 'guideline', 'information', 'process', 'fusion', 'analytics', 'algorithm', 'mapreduce', 'big', 'data', 'hot', 'air', 'hot', 'topic', 'big', 'data', 'institutional', 'perspective', 'opportunity', 'challenge', 'big', 'data', 'blueprint', 'america', 'probed', 'citizen', 'life', 'never', 'big', 'data', 'cbc', 'diff', 'big', 'data', 'driven', 'abnormal', 'behavior', 'detection', 'healthcare', 'based', 'association', 'rule', 'big', 'data', 'transforming', 'drug', 'development', 'health', 'policy', 'decision', 'making', 'big', 'data', 'analysis', 'putting', 'data', 'cart', 'modelling', 'horse', 'big', 'data', 'finder', 'keeper', 'loser', 'weeper', 'big', 'data', 'framework', 'research', 'big', 'data', 'short', 'introduction', 'big', 'data', 'meaning', 'big', 'question', 'big', 'data', 'big', 'data', 'state', 'art', 'concept', 'technique', 'technology', 'modeling', 'approach', 'research', 'challenge', 'big', 'data', 'unleashing', 'information', 'big', 'data', 'new', 'opportunity', 'new', 'challenge', 'guest', 'editor', 'introduction', 'big', 'data', 'informed', 'energy', 'efficiency', 'assessment', 'china', 'industry', 'sector', 'based', 'k', 'mean', 'clustering', 'big', 'data', 'web', 'crawling', 'analysing', 'financial', 'news', 'using', 'rapidminer', 'big', 'data', 'web', 'crawling', 'analysing', 'financial', 'news', 'using', 'rapidminer', 'data', 'driven', 'stochastic', 'robust', 'optimization', 'general', 'computational', 'framework', 'algorithm', 'leveraging', 'machine', 'learning', 'optimization', 'uncertainty', 'big', 'data', 'era', 'big', 'data', 'transforming', 'design', 'philosophy', 'future', 'internet', 'big', 'data', 'e', 'amp', 'p', 'real', 'time', 'adaptive', 'analytics', 'data', 'flow', 'architecture', 'big', 'data', 'game', 'changer', 'environmental', 'management', 'big', 'data', 'trend', 'stroke', 'epidemiology', 'united', 'state', 'good', 'data', 'big', 'building', 'big', 'data', 'energy', 'disclosure', 'policy', 'impact', 'energy', 'use', 'time', 'big', 'data', 'go', 'granular', 'boost', 'sale', 'look', 'advance', 'technology', 'enhancing', 'sale', 'marketing', 'effort', 'commercial', 'launderers', 'big', 'data', 'revolution', 'oder', 'datenhybris', 'big', 'data', 'use', 'data', 'science', 'predictive', 'analytics', 'big', 'data', 'revolution', 'transform', 'supply', 'chain', 'design', 'management', 'big', 'data', 'big', 'unintended', 'consequence', 'big', 'data', 'intensive', 'care', 'unit', 'closing', 'data', 'loop', 'big', 'data', 'promise', 'problem', 'big', 'data', 'adding', 'using', 'big', 'data', 'revolution', 'transform', 'live', 'work', 'think', 'viktor', 'mayer', 'schonberger', 'kenneth', 'cukier', 'big', 'data', 'new', 'look', 'old', 'problem', 'big', 'data', 'trend', 'stroke', 'epidemiology', 'united', 'state', 'good', 'data', 'big', 'data', 'information', 'age', 'big', 'data', 'new', 'perspective', 'establishment', 'segment', 'niche', 'marketing', 'big', 'data', 'stay', 'big', 'data', 'breaking', 'new', 'ground', 'airway', 'research', 'big', 'deep', 'smart', 'data', 'imaging', 'guiding', 'material', 'design', 'big', 'data', 'primer', 'big', 'data', 'flood', 'warning', 'big', 'data', 'qualitative', 'approach', 'digital', 'research', 'big', 'data', 'large', 'scale', 'historical', 'infrastructure', 'minnesota', 'population', 'center', 'big', 'data', 'challenge', 'small', 'research', 'group', 'era', 'cancer', 'genomics', 'big', 'data', 'driven', 'machine', 'learning', 'enabled', 'traffic', 'flow', 'prediction', 'big', 'data', 'normal', 'accident', 'waiting', 'happen', 'big', 'data', 'revolution', 'transform', 'live', 'work', 'think', 'big', 'data', 'progress', 'big', 'headache', 'big', 'data', 'hadoop', 'cloud', 'computing', 'genomics', 'big', 'data', 'case', 'study', 'disruption', 'government', 'power', 'big', 'data', 'next', 'challenge', 'statistic', 'big', 'data', 'cheerleader', 'translational', 'perioperative', 'medicine', 'big', 'data', 'incentivized', 'self', 'big', 'data', 'laboratory', 'medicine', 'big', 'data', 'approach', 'trauma', 'outcome', 'prediction', 'autonomous', 'resuscitation', 'shiming', 'yang', 'mary', 'njoku', 'colin', 'f', 'mackenzie', 'discus', 'massive', 'clinical', 'data', 'used', 'learning', 'prediction', 'model', 'level', 'trauma', 'centre', 'support', 'decision', 'making', 'trauma', 'patient', 'big', 'data', 'paradigm', 'shift', 'needed', 'revolutionize', 'musculoskeletal', 'clinical', 'research', 'data', 'privacy', 'big', 'data', 'compliance', 'issue', 'consideration', 'big', 'data', 'driven', 'marketing', 'machine', 'learning', 'outperforms', 'marketer', 'gut', 'feeling', 'big', 'data', 'use', 'big', 'data', 'practical', 'application', 'big', 'data', 'little', 'difference', 'big', 'data', 'practical', 'application', 'scanning', 'issue', 'big', 'data', 'could', 'ever', 'cure', 'alzheimer', 'disease', 'big', 'data', 'start', 'big', 'data', 'issue', 'challenge', 'moving', 'forward', 'big', 'data', 'theoretical', 'aspect', 'scanning', 'issue', 'big', 'data', 'ethical', 'issue', 'big', 'data', 'target', 'customer', 'customer', 'big', 'data', 'introduction', 'librarian', 'big', 'data', 'herausforderungen', 'und', 'potenziale', 'f', 'r', 'deutsche', 'softwareunternehmen', 'big', 'data', 'reporting', 'guideline', 'big', 'data', 'role', 'expanding', 'access', 'financial', 'service', 'china', 'big', 'data', 'challenge', 'opportunity', 'clinical', 'pharmacology', 'big', 'data', 'ist', 'da', 'und', 'bedeutet', 'e', 'f', 'r', 'wissenschaft', 'wirtschaft', 'und', 'gesellschaft', 'big', 'data', 'new', 'trick', 'econometrics', 'big', 'data', 'key', 'energy', 'efficiency', 'smart', 'building', 'big', 'data', 'survey', 'big', 'data', 'driven', 'supply', 'chain', 'performance', 'measurement', 'system', 'review', 'framework', 'implementation', 'big', 'data', 'new', 'science', 'new', 'challenge', 'new', 'dialogical', 'opportunity', 'big', 'data', 'dark', 'side', 'big', 'data', 'sparse', 'data', 'diverse', 'scientific', 'benchmark', 'reveal', 'optimization', 'imperative', 'implicit', 'membrane', 'energy', 'function', 'big', 'data', 'technique', 'technology', 'geoinformatics', 'big', 'data', 'issue', 'international', 'political', 'sociology', 'data', 'practice', 'big', 'data', 'driven', 'service', 'composition', 'using', 'parallel', 'clustered', 'particle', 'swarm', 'optimization', 'mobile', 'environment', 'big', 'data', 'vascular', 'surgery', 'reality', 'missed', 'use', 'caution', 'big', 'data', 'ownership', 'data', 'recent', 'development', 'europe', 'big', 'data', 'small', 'data', 'methodological', 'review', 'sustainable', 'tourism', 'big', 'data', 'new', 'opportunity', 'transport', 'geography', 'big', 'data', 'panacea', 'big', 'data', 'small', 'data', 'methodological', 'review', 'sustainable', 'tourism', 'data', 'intensive', 'application', 'challenge', 'technique', 'technology', 'survey', 'big', 'data', 'memory', 'tsvd', 'big', 'data', 'factorization', 'government', 'responsiveness', 'public', 'acceptance', 'big', 'data', 'technology', 'urban', 'governance', 'evidence', 'china', 'covid', 'pandemic', 'globeland', 'operational', 'global', 'land', 'cover', 'mapping', 'big', 'data', 'analysis', 'emergency', 'drug', 'procurement', 'planning', 'based', 'big', 'data', 'driven', 'morbidity', 'prediction', 'e', 'commerce', 'logistics', 'distribution', 'mode', 'big', 'data', 'context', 'case', 'analysis', 'jd', 'com', 'wichtig', 'ist', 'auch', 'der', 'hinweis', 'das', 'big', 'data', 'und', 'vergleichbare', 'konzepte', 'nicht', 'unmittelbar', 'greifen', 'study', 'monitoring', 'model', 'big', 'data', 'traffic', 'analysis', 'utilization', 'structural', 'health', 'monitoring', 'big', 'data', 'problem', 'position', 'synchronization', 'track', 'geometry', 'inspection', 'data', 'via', 'big', 'data', 'fusion', 'incremental', 'learning', 'high', 'performance', 'interconnection', 'network', 'exascale', 'big', 'data', 'era', 'rw', 'application', 'artificial', 'intelligence', 'medical', 'big', 'data', 'real', 'world', 'evidence', 'generation', 'literature', 'review', 'research', 'practice', 'china', 'biology', 'must', 'develop', 'big', 'data', 'system', 'kernel', 'based', 'framework', 'medical', 'big', 'data', 'analytics', 'identifying', 'industrial', 'cluster', 'novel', 'big', 'data', 'methodology', 'sic', 'code', 'fit', 'purpose', 'internet', 'age', 'mapping', 'collective', 'behavior', 'big', 'data', 'era', 'monsanto', 'throw', 'hat', 'big', 'data', 'ring', 'failure', 'analysis', 'prediction', 'big', 'data', 'system', 'interview', 'mit', 'michael', 'feindt', 'zum', 'thema', 'pr', 'skriptive', 'big', 'data', 'analyse', 'evolutionary', 'scheduling', 'dynamic', 'multitasking', 'workload', 'big', 'data', 'analytics', 'elastic', 'cloud', 'linear', 'estimation', 'problem', 'information', 'big', 'data', 'system', 'cost', 'optimization', 'deadline', 'aware', 'scheduling', 'big', 'data', 'processing', 'job', 'cloud', 'parallel', 'distributed', 'successive', 'convex', 'approximation', 'method', 'big', 'data', 'optimization', 'best', 'way', 'get', 'business', 'value', 'big', 'data', 'analysis', 'deep', 'thinker', 'three', 'top', 'tier', 'business', 'reaping', 'big', 'reward', 'big', 'data', 'analytics', 'key', 'success', 'include', 'deeply', 'rooted', 'culture', 'analytics', 'relentless', 'focus', 'cost', 'efficiency', 'process', 'improvement', 'convergent', 'evolution', 'army', 'ant', 'syndrome', 'congruence', 'big', 'data', 'phylogenetics', 'analysis', 'common', 'team', 'approach', 'discovery', 'big', 'data', 'environment', 'genetic', 'epidemiology', 'risk', 'assessment', 'secondary', 'primary', 'malignancy', 'nasopharyngeal', 'carcinoma', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'long', 'term', 'survivor', 'endemic', 'area', 'treated', 'intensity', 'modulated', 'radiation', 'therapy', 'selected', 'peer', 'reviewed', 'article', 'rd', 'international', 'conference', 'big', 'data', 'iot', 'cloud', 'computing', 'technology', 'application', 'bicta', 'daejeon', 'korea', 'november', 'mining', 'based', 'time', 'series', 'sleeping', 'pattern', 'analysis', 'life', 'big', 'data', 'kind', 'novel', 'based', 'space', 'air', 'ground', 'big', 'data', 'managing', 'default', 'risk', 'trade', 'credit', 'implement', 'big', 'data', 'analytics', 'supply', 'chain', 'digital', 'conversation', 'suicide', 'among', 'teenager', 'adult', 'epilepsy', 'big', 'data', 'machine', 'learning', 'analysis', 'effective', 'crude', 'oil', 'price', 'forecasting', 'using', 'new', 'text', 'based', 'big', 'data', 'driven', 'model', 'stakeholder', 'concern', 'air', 'pollution', 'hong', 'kong', 'policy', 'implication', 'big', 'data', 'computational', 'text', 'analysis', 'approach', 'raffi', 'krikorian', 'left', 'uber', 'help', 'dnc', 'regain', 'big', 'data', 'mojo', 'reproductive', 'medicine', 'crossroad', 'stem', 'cell', 'biology', 'big', 'data', 'novel', 'gpu', 'aware', 'histogram', 'based', 'algorithm', 'supporting', 'moving', 'object', 'segmentation', 'big', 'data', 'based', 'iot', 'application', 'scenario', 'proxy', 'expenditure', 'weight', 'consumer', 'price', 'index', 'audit', 'sampling', 'inference', 'big', 'data', 'statistic', 'modeling', 'global', 'management', 'complex', 'product', 'subject', 'data', 'oriented', 'big', 'data', 'anatomy', 'scholarly', 'collaboration', 'engineering', 'education', 'big', 'data', 'bibliometric', 'analysis', 'stochastic', 'decision', 'making', 'adaptive', 'crowdsourcing', 'medical', 'big', 'data', 'platform', 'introduction', 'special', 'issue', 'online', 'learning', 'big', 'data', 'driven', 'transportation', 'mobility', 'genomic', 'sequencing', 'assessing', 'health', 'care', 'system', 'policy', 'big', 'data', 'implication', 'efficient', 'forward', 'secure', 'lattice', 'based', 'searchable', 'encryption', 'scheme', 'big', 'data', 'era', 'commuting', 'inequity', 'determinant', 'shanghai', 'new', 'finding', 'big', 'data', 'analytics', 'androgen', 'deprivation', 'therapy', 'dementia', 'new', 'opportunity', 'challenge', 'big', 'data', 'era', 'camp', 'accurate', 'modeling', 'core', 'memory', 'locality', 'proxy', 'generation', 'big', 'data', 'application', 'symmetrical', 'compression', 'distance', 'arrhythmia', 'discrimination', 'cloud', 'based', 'big', 'data', 'service', 'cloud', 'distributed', 'gpu', 'architecture', 'pattern', 'identification', 'segmented', 'detector', 'big', 'data', 'survey', 'dual', 'drivetrain', 'model', 'digital', 'transformation', 'role', 'industrial', 'big', 'data', 'based', 'affordance', 'chew', 'singapore', 'based', 'health', 'informatics', 'company', 'mhc', 'asia', 'group', 'crunch', 'big', 'data', 'uncover', 'company', 'health', 'mhc', 'asia', 'singapore', 'real', 'time', 'business', 'activity', 'monitoring', 'analysis', 'process', 'performance', 'big', 'data', 'domain', 'analyzing', 'influencing', 'factor', 'urban', 'thermal', 'field', 'intensity', 'using', 'big', 'data', 'based', 'gi', 'predicting', 'future', 'mental', 'illness', 'social', 'medium', 'big', 'data', 'approach', 'predicting', 'congestion', 'state', 'basic', 'safety', 'message', 'using', 'big', 'data', 'graph', 'analytics', 'error', 'correction', 'anti', 'interference', 'coding', 'method', 'tracking', 'big', 'data', 'information', 'commodity', 'introduction', 'special', 'issue', 'online', 'learning', 'big', 'data', 'driven', 'transportation', 'mobility', 'siting', 'public', 'electric', 'vehicle', 'charging', 'station', 'beijing', 'using', 'big', 'data', 'informed', 'travel', 'pattern', 'taxi', 'fleet', 'aspirin', 'heart', 'disease', 'form', 'focus', 'u', 'big', 'data', 'health', 'initiative', 'leadership', 'action', 'top', 'hacker', 'behave', 'big', 'data', 'approach', 'text', 'mining', 'sentiment', 'analysis', 'competing', 'risk', 'nomogram', 'nasopharyngeal', 'carcinoma', 'intensity', 'modulated', 'radiotherapy', 'era', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'investor', 'short', 'stock', 'acquiring', 'firm', 'big', 'data', 'insight', 'joining', 'car', 'eva', 'style', 'analysis', 'energy', 'management', 'platform', 'micro', 'grid', 'system', 'using', 'internet', 'thing', 'big', 'data', 'technology', 'predicting', 'hospital', 'mortality', 'admission', 'medical', 'ward', 'big', 'data', 'machine', 'learning', 'model', 'pmu', 'top', 'costly', 'disease', 'china', 'big', 'data', 'database', 'analysis', 'quantifying', 'uncertainty', 'internet', 'medical', 'thing', 'big', 'data', 'service', 'using', 'intelligence', 'deep', 'learning', 'candidate', 'gene', 'identification', 'strategy', 'utilizing', 'mouse', 'human', 'big', 'data', 'mining', 'r', 'tenet', 'copd', 'genetic', 'research', 'special', 'issue', 'trend', 'high', 'performance', 'interconnection', 'network', 'exascale', 'big', 'data', 'era', 'novel', 'altered', 'k', 'mean', 'algorithm', 'clustering', 'cost', 'decrease', 'non', 'labeling', 'big', 'data', 'development', 'human', 'emotion', 'circuit', 'investigated', 'using', 'big', 'data', 'analytic', 'approach', 'stability', 'reliability', 'robustness', 'survival', 'impact', 'radiotherapy', 'interruption', 'nasopharyngeal', 'carcinoma', 'intensity', 'modulated', 'radiotherapy', 'era', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'demographic', 'distribution', 'hospital', 'admission', 'brain', 'arteriovenous', 'malformation', 'germany', 'estimation', 'natural', 'course', 'big', 'data', 'approach', 'granulomatosis', 'polyangiitis', 'presenting', 'acute', 'aortic', 'mitral', 'regurgitation', 'case', 'report', 'big', 'data', 'analysis', 'utility', 'maximization', 'model', 'retrieving', 'user', 'willingness', 'travel', 'participating', 'activity', 'big', 'data', 'r', 'nonvolatile', 'tcam', 'using', 'mlc', 'reram', 'frequent', 'instant', 'filter', 'iot', 'big', 'data', 'processing', 'use', 'crowdfunding', 'social', 'medium', 'platform', 'strategic', 'start', 'communication', 'big', 'data', 'analysis', 'e', 'government', 'recommendation', 'algorithm', 'based', 'probabilistic', 'semantic', 'cluster', 'analysis', 'combination', 'improved', 'collaborative', 'filtering', 'big', 'data', 'environment', 'government', 'affair', 'improved', 'svm', 'rfe', 'based', 'intensity', 'dependent', 'normalization', 'feature', 'selection', 'gene', 'expression', 'big', 'data', 'opportunity', 'energy', 'efficient', 'computing', 'study', 'inexact', 'general', 'purpose', 'processor', 'high', 'performance', 'big', 'data', 'application', 'molecular', 'revolution', 'cutaneous', 'biology', 'era', 'genome', 'wide', 'association', 'study', 'statistical', 'big', 'data', 'computational', 'topic', 'evolution', 'regional', 'low', 'carbon', 'innovation', 'system', 'sustainable', 'development', 'empirical', 'study', 'big', 'data', 'reram', 'based', 'r', 'nonvolatile', 'tcam', 'using', 'rc', 'filtered', 'stress', 'decoupled', 'scheme', 'frequent', 'instant', 'search', 'engine', 'used', 'iot', 'big', 'data', 'processing', 'hepatitis', 'b', 'virus', 'screening', 'reactivation', 'management', 'patient', 'nasopharyngeal', 'carcinoma', 'large', 'scale', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'endemic', 'area', 'scheduling', 'instruction', 'enhanced', 'performance', 'energy', 'efficiency', 'clustered', 'architecture', 'application', 'big', 'data', 'sensor', 'network', 'optimizing', 'induction', 'chemotherapy', 'regimen', 'patient', 'locoregionally', 'advanced', 'nasopharyngeal', 'carcinoma', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'pr', 'cost', 'analysis', 'current', 'situation', 'pneumonia', 'treatment', 'china', 'big', 'data', 'hospital', 'information', 'system', 'database', 'analysis', 'environmental', 'impact', 'characterization', 'packaging', 'waste', 'generated', 'urban', 'food', 'delivery', 'service', 'big', 'data', 'analysis', 'jing', 'jin', 'ji', 'region', 'china', 'small', 'batch', 'size', 'convolutional', 'neural', 'network', 'based', 'fault', 'diagnosis', 'system', 'nuclear', 'energy', 'production', 'safety', 'big', 'data', 'environment', 'low', 'alt', 'value', 'amongst', 'hospitalized', 'patient', 'associated', 'increased', 'risk', 'hypoglycemia', 'overall', 'mortality', 'retrospective', 'big', 'data', 'analysis', 'patient', 'clinical', 'feature', 'survival', 'outcome', 'ascending', 'descending', 'type', 'nasopharyngeal', 'carcinoma', 'intensity', 'modulated', 'radiotherapy', 'era', 'big', 'data', 'intelligence', 'platform', 'based', 'analysis', 'agricultural', 'cropland', 'extent', 'area', 'south', 'asia', 'derived', 'using', 'landsat', 'satellite', 'time', 'series', 'big', 'data', 'using', 'random', 'forest', 'machine', 'learning', 'algorithm', 'google', 'earth', 'engine', 'cloud', 'corporate', 'social', 'responsibility', 'green', 'supply', 'chain', 'management', 'firm', 'performance', 'moderating', 'role', 'big', 'data', 'analytics', 'capability', 'impact', 'covid', 'pandemic', 'user', 'behavior', 'environmental', 'benefit', 'bike', 'sharing', 'big', 'data', 'analysis', 'sleep', 'screen', 'exposure', 'across', 'beginning', 'life', 'deciphering', 'link', 'using', 'big', 'data', 'analytics', 'detection', 'onset', 'agitation', 'patient', 'dementia', 'real', 'time', 'monitoring', 'application', 'big', 'data', 'solution', 'correlation', 'total', 'solar', 'irradiance', 'glycated', 'hemoglobin', 'month', 'later', 'patient', 'diabetes', 'big', 'data', 'analysis', 'benchmark', 'approach', 'toolkit', 'online', 'scheduling', 'multiple', 'deadline', 'constrained', 'workflow', 'big', 'data', 'processing', 'system', 'review', 'modern', 'cryptography', 'world', 'war', 'ii', 'era', 'big', 'data', 'era', 'determinant', 'role', 'gender', 'age', 'sii', 'plr', 'nlr', 'lmr', 'mlr', 'reference', 'interval', 'defining', 'henan', 'china', 'posteriori', 'big', 'data', 'based', 'big', 'data', 'algorithm', 'knn', 'pls', 'toward', 'business', 'process', 'innovation', 'big', 'data', 'era', 'mediating', 'role', 'big', 'data', 'knowledge', 'management', 'overcoming', 'resistance', 'big', 'data', 'operational', 'change', 'interactive', 'data', 'visualization', 'furthest', 'pair', 'based', 'binary', 'search', 'tree', 'speeding', 'big', 'data', 'classification', 'using', 'k', 'nearest', 'neighbor', 'stgi', 'spatio', 'temporal', 'grid', 'index', 'model', 'marine', 'big', 'data', 'call', 'paper', 'special', 'issue', 'big', 'data', 'robotics', 'tv', 'audience', 'measurement', 'big', 'data', 'correction', 'big', 'data', 'unsupervised', 'tensor', 'mining', 'big', 'data', 'practitioner', 'soom', 'sort', 'based', 'optimizer', 'big', 'data', 'multi', 'query', 'interview', 'dr', 'silvio', 'carta', 'author', 'book', 'big', 'data', 'code', 'discrete', 'city', 'routledge', 'optimizing', 'sentinel', 'image', 'selection', 'big', 'data', 'context', 'new', 'discrimination', 'diagram', 'basalt', 'based', 'big', 'data', 'research', 'comparative', 'study', 'big', 'data', 'global', 'adakites', 'mineralization', 'related', 'granite', 'geza', 'arc', 'metallogenic', 'belt', 'northwest', 'yunnan', 'southwest', 'china', 'internet', 'thing', 'based', 'optimized', 'routing', 'big', 'data', 'gathering', 'system', 'landslide', 'detection', 'common', 'big', 'data', 'challenge', 'overcome', 'recent', 'advance', 'earth', 'observation', 'big', 'data', 'hydrology', 'big', 'data', 'guide', 'understanding', 'climate', 'change', 'case', 'theory', 'guided', 'data', 'science', 'new', 'big', 'data', 'approach', 'based', 'geoecological', 'information', 'modeling', 'system', 'big', 'data', 'enabler', 'primary', 'education', 'call', 'paper', 'special', 'issue', 'big', 'data', 'robotics', 'big', 'data', 'predictive', 'analytics', 'health', 'care', 'big', 'data', 'big', 'deal', 'big', 'data', 'rise', 'machine', 'financial', 'market', 'call', 'special', 'issue', 'paper', 'big', 'data', 'business', 'call', 'special', 'issue', 'paper', 'big', 'data', 'business', 'big', 'data', 'efficient', 'market', 'end', 'daily', 'fantasy', 'sport', 'know', 'big', 'uncertain', 'data', 'multiple', 'sensor', 'efficient', 'processing', 'high', 'order', 'multi', 'hypothesis', 'evidence', 'theoretic', 'approach', 'us', 'big', 'data', 'city', 'big', 'data', 'analytics', 'data', 'management', 'perspective', 'public', 'administration', 'deep', 'learning', 'big', 'sparse', 'behavioral', 'data', 'comparison', 'scalability', 'batch', 'big', 'data', 'processing', 'apache', 'spark', 'apache', 'flink', 'ensemble', 'based', 'scalable', 'approach', 'intrusion', 'detection', 'using', 'big', 'data', 'framework', 'exercise', 'exploring', 'big', 'data', 'producing', 'reliable', 'statistical', 'information', 'big', 'data', 'cataclysm', 'catalyst', 'combining', 'human', 'computing', 'machine', 'learning', 'make', 'sense', 'big', 'aerial', 'data', 'disaster', 'response', 'big', 'data', 'ensemble', 'clinical', 'prediction', 'healthcare', 'data', 'using', 'deep', 'learning', 'model', 'big', 'data', 'analytics', 'capability', 'digital', 'transformation', 'insurance', 'sector', 'curating', 'big', 'data', 'made', 'simple', 'perspective', 'scientific', 'community', 'health', 'big', 'data', 'analytics', 'current', 'perspective', 'challenge', 'potential', 'solution', 'agile', 'big', 'data', 'analytics', 'web', 'based', 'system', 'architecture', 'centric', 'approach', 'scalable', 'big', 'data', 'modelling', 'analyzing', 'big', 'smart', 'metering', 'data', 'towards', 'differentiated', 'user', 'service', 'sublinear', 'approach', 'clustering', 'big', 'spatiotemporal', 'interval', 'data', 'intelligent', 'big', 'data', 'analysis', 'review', 'intelligent', 'big', 'data', 'analysis', 'review', 'big', 'data', 'analytics', 'framework', 'border', 'crossing', 'transportation', 'leveraging', 'big', 'data', 'analytics', 'case', 'kenyan', 'telecom', 'health', 'big', 'data', 'analytics', 'current', 'perspective', 'challenge', 'potential', 'solution', 'improved', 'big', 'data', 'stock', 'index', 'prediction', 'using', 'deep', 'learning', 'cnn', 'gru', 'guest', 'editorial', 'big', 'data', 'analytics', 'web', 'speed', 'big', 'data', 'analytics', 'unveiling', 'storage', 'distribution', 'sub', 'datasets', 'guest', 'editorial', 'big', 'data', 'infrastructure', 'guest', 'editorial', 'big', 'data', 'analytics', 'web', 'guest', 'editorial', 'big', 'scholar', 'data', 'discovery', 'collaboration', 'guest', 'editorial', 'big', 'scholar', 'data', 'discovery', 'collaboration', 'towards', 'quality', 'service', 'driven', 'consistency', 'big', 'data', 'management', 'review', 'ethical', 'concern', 'big', 'data', 'management', 'fuzzy', 'based', 'scalable', 'clustering', 'algorithm', 'handling', 'big', 'data', 'using', 'apache', 'spark', 'guest', 'editorial', 'special', 'issue', 'big', 'scholar', 'data', 'discovery', 'collaboration', 'continued', 'concentric', 'framework', 'leveraging', 'big', 'data', 'business', 'value', 'deduplication', 'encrypted', 'big', 'data', 'cloud', 'online', 'similarity', 'learning', 'big', 'data', 'overfitting', 'index', 'ieee', 'transaction', 'big', 'data', 'vol', 'enhanced', 'visualization', 'method', 'aid', 'behavioral', 'trajectory', 'pattern', 'recognition', 'infrastructure', 'big', 'longitudinal', 'data', 'hdm', 'composable', 'framework', 'big', 'data', 'processing', 'survey', 'development', 'context', 'aware', 'monitoring', 'healthcare', 'big', 'data', 'distributed', 'feature', 'selection', 'efficient', 'economic', 'big', 'data', 'analysis', 'algorithmseer', 'system', 'extracting', 'searching', 'algorithm', 'scholarly', 'big', 'data', 'taxi', 'passenger', 'demand', 'modeling', 'based', 'big', 'data', 'roving', 'sensor', 'network', 'towards', 'max', 'min', 'fair', 'resource', 'allocation', 'stream', 'big', 'data', 'analytics', 'shared', 'cloud', 'thermal', 'aware', 'dvfs', 'enabled', 'big', 'data', 'task', 'scheduling', 'data', 'center', 'structural', 'balance', 'theory', 'based', 'e', 'commerce', 'recommendation', 'big', 'rating', 'data', 'resting', 'state', 'fmri', 'functional', 'connectivity', 'big', 'data', 'preprocessing', 'pipeline', 'topological', 'data', 'analysis', 'hybridisation', 'classifier', 'anomaly', 'detection', 'big', 'data', 'extended', 'spatio', 'temporal', 'granger', 'causality', 'model', 'air', 'quality', 'estimation', 'heterogeneous', 'urban', 'big', 'data', 'platform', 'big', 'data', 'analytics', 'distributed', 'scale', 'storage', 'system', 'large', 'scale', 'spectral', 'clustering', 'managing', 'big', 'data', 'healthcare', 'operation', 'hierarchical', 'distributed', 'processing', 'framework', 'big', 'image', 'data', 'nosql', 'database', 'big', 'data', 'petuum', 'new', 'platform', 'distributed', 'machine', 'learning', 'big', 'data', 'towards', 'real', 'time', 'big', 'data', 'analytics', 'platform', 'health', 'application', 'towards', 'quality', 'service', 'driven', 'consistency', 'big', 'data', 'management', 'current', 'trend', 'predictive', 'analytics', 'big', 'data', 'parallel', 'computing', 'preserving', 'privacy', 'using', 'k', 'anonymisation', 'algorithm', 'big', 'data', 'toward', 'efficient', 'flexible', 'metadata', 'indexing', 'big', 'data', 'system', 'index', 'ieee', 'transaction', 'big', 'data', 'vol', 'platform', 'big', 'data', 'analytics', 'distributed', 'scale', 'storage', 'system', 'secure', 'multi', 'owner', 'based', 'cloud', 'computing', 'scheme', 'big', 'data', 'classification', 'comparison', 'nosql', 'big', 'data', 'model', 'survey', 'big', 'data', 'processing', 'infrastructure', 'evolving', 'role', 'fpga', 'palopro', 'platform', 'knowledge', 'extraction', 'big', 'social', 'data', 'news', 'five', 'layer', 'architecture', 'big', 'data', 'processing', 'analytics', 'corporate', 'governance', 'fraud', 'detection', 'annual', 'report', 'using', 'big', 'data', 'analytics', 'smart', 'monitoring', 'camera', 'driven', 'intelligent', 'processing', 'big', 'surveillance', 'video', 'data', 'kvasir', 'scalable', 'provision', 'semantically', 'relevant', 'web', 'content', 'big', 'data', 'framework', 'optimized', 'deep', 'learning', 'eeg', 'big', 'data', 'seizure', 'prediction', 'bci', 'via', 'internet', 'thing', 'dip', 'svm', 'distribution', 'preserving', 'kernel', 'support', 'vector', 'machine', 'big', 'data', 'perception', 'independent', 'financial', 'advisor', 'usefulness', 'big', 'data', 'context', 'decision', 'making', 'uk', 'fast', 'approach', 'semantic', 'service', 'composition', 'big', 'data', 'environment', 'availability', 'modelling', 'assurance', 'big', 'data', 'computing']

데이터 탐색 및 분석 모델 구축

count = Counter(words2)
count   #출력하여 내용 확인
Counter({'big': 1116,
         'data': 1240,
         'based': 96,
         'platform': 32,
         'worker': 3,
         'behavior': 9,
         'observation': 6,
         'field': 5,
         'using': 54,
         'surface': 2,
         'fitting': 2,
         'improve': 10,
         'aircraft': 2,
         'safety': 10,
         'study': 17,
         'relationship': 7,
         'anomaly': 3,
         'method': 21,
         'predict': 2,
         'target': 4,
         'small': 18,
         'molecule': 1,
         'accelerate': 1,
         'drug': 8,
         'discovery': 13,
         'social': 17,
         'content': 4,
         'dissemination': 2,
         'internet': 18,
         'vehicle': 6,
         'analysis': 89,
         'disaster': 8,
         'information': 22,
         'south': 2,
         'korea': 2,
         'real': 20,
         'world': 20,
         'laboratory': 2,
         'medicine': 10,
         'current': 8,
         'status': 5,
         'application': 44,
         'future': 18,
         'consideration': 3,
         'applying': 4,
         'technology': 26,
         'network': 24,
         'architecture': 11,
         'ecology': 4,
         'approach': 55,
         'lead': 1,
         'increased': 2,
         'understanding': 11,
         'animal': 1,
         'movement': 1,
         'persisting': 1,
         'nosql': 4,
         'landscape': 2,
         'political': 7,
         'rhetoric': 1,
         'relating': 1,
         'development': 18,
         'united': 5,
         'state': 12,
         'china': 18,
         'global': 9,
         'power': 6,
         'thematic': 1,
         'business': 25,
         'ethic': 3,
         'security': 15,
         'mechanism': 2,
         'fully': 1,
         'homomorphic': 1,
         'encryption': 2,
         'cubic': 1,
         'spline': 1,
         'curve': 1,
         'public': 9,
         'key': 4,
         'cryptography': 2,
         'perspective': 26,
         'ai': 4,
         'newton': 1,
         'merton': 1,
         'analytics': 108,
         'intelligence': 21,
         'spatial': 8,
         'temporal': 3,
         'bovine': 1,
         'tuberculosis': 1,
         'wildlife': 1,
         'badger': 1,
         'cattle': 1,
         'company': 8,
         'enter': 1,
         'classroom': 1,
         'pioneer': 1,
         'issue': 48,
         'warning': 2,
         'model': 45,
         'multi': 11,
         'modal': 1,
         'transportation': 8,
         'macroscopic': 1,
         'control': 11,
         'optimisation': 1,
         'oriented': 5,
         'recommendation': 6,
         'objective': 1,
         'optimization': 22,
         'sears': 1,
         'strategy': 10,
         'service': 21,
         'call': 31,
         'away': 1,
         'job': 2,
         'outlook': 1,
         'encouraging': 1,
         'gm': 1,
         'plan': 1,
         'hire': 1,
         'hp': 1,
         'employee': 2,
         'dynamic': 9,
         'broadcast': 1,
         'fat': 1,
         'tree': 3,
         'center': 9,
         'mobile': 6,
         'iot': 11,
         'device': 8,
         'becoming': 4,
         'savvy': 2,
         'bifm': 1,
         'driven': 67,
         'intelligent': 13,
         'forecasting': 6,
         'covid': 6,
         'distributed': 22,
         'via': 8,
         'blockwise': 1,
         'gradient': 1,
         'tracking': 4,
         'industrial': 8,
         'cps': 1,
         'adaptive': 5,
         'production': 5,
         'scheduling': 10,
         'smart': 28,
         'manufacturing': 6,
         'wireless': 6,
         'use': 21,
         'case': 12,
         'solution': 12,
         'trend': 9,
         'streetlytics': 1,
         'estate': 1,
         'market': 10,
         'rating': 3,
         'framework': 35,
         'measuring': 1,
         'human': 11,
         'mobility': 5,
         'app': 1,
         'usage': 1,
         'analyzing': 6,
         'modeling': 15,
         'non': 5,
         'pharmaceutical': 4,
         'intervention': 1,
         'pandemic': 3,
         'recent': 4,
         'struggle': 1,
         'birthing': 1,
         'pain': 4,
         'researcher': 2,
         'say': 2,
         'selbstlernende': 1,
         'analysen': 1,
         'f': 5,
         'r': 9,
         'intelligentes': 1,
         'energiemanagement': 1,
         'capturing': 2,
         'visitor': 1,
         'activity': 3,
         'flow': 5,
         'island': 1,
         'country': 1,
         'research': 27,
         'mining': 26,
         'visualization': 8,
         'tensor': 4,
         'routing': 2,
         'heterogeneous': 6,
         'comprehensive': 3,
         'monitoring': 12,
         'system': 59,
         'yield': 2,
         'enhancement': 1,
         'semiconductor': 1,
         'cloud': 30,
         'unmanned': 2,
         'ground': 3,
         'improving': 3,
         'optimizing': 4,
         'read': 1,
         'editorial': 15,
         'advanced': 6,
         'tool': 4,
         'high': 12,
         'throughput': 2,
         'omics': 1,
         'traditional': 3,
         'v': 4,
         'fashion': 1,
         'examination': 1,
         'wgsn': 1,
         'edited': 2,
         'partitioned': 1,
         'asynchronous': 1,
         'dual': 2,
         'decomposition': 2,
         'lesson': 4,
         'project': 5,
         'four': 1,
         'health': 33,
         'illustrate': 1,
         'potential': 9,
         'shared': 2,
         'national': 5,
         'multipurpose': 1,
         'ibm': 1,
         'stellt': 1,
         'au': 1,
         'twitter': 4,
         'vor': 1,
         'leveraging': 9,
         'revolution': 12,
         'cm': 1,
         'expanding': 3,
         'capability': 9,
         'spur': 1,
         'transformation': 7,
         'st': 1,
         'century': 1,
         'investing': 1,
         'soaring': 1,
         'despite': 1,
         'price': 4,
         'still': 2,
         'possible': 1,
         'cash': 1,
         'advancing': 2,
         'conceptual': 4,
         'symbiosis': 1,
         'exploring': 8,
         'waste': 2,
         'stream': 6,
         'ari': 1,
         'caroline': 1,
         'guru': 1,
         'mine': 3,
         'help': 6,
         'cancer': 5,
         'patient': 12,
         'resource': 6,
         'career': 2,
         'methodology': 4,
         'time': 19,
         'sustainability': 1,
         'city': 8,
         'towards': 11,
         'inferencing': 1,
         'conex': 1,
         'efficient': 18,
         'exploration': 1,
         'configuration': 1,
         'better': 3,
         'performance': 20,
         'daten': 1,
         'und': 6,
         'diabetes': 2,
         'medizin': 1,
         'edge': 3,
         'computing': 26,
         'enabled': 11,
         'electric': 4,
         'charging': 3,
         'accountability': 1,
         'algorithm': 32,
         'enhancing': 2,
         'physical': 4,
         'layer': 2,
         'aided': 1,
         'hybrid': 2,
         'relay': 1,
         'selection': 8,
         'depression': 1,
         'multimethod': 1,
         'investigation': 3,
         'american': 1,
         'emotional': 1,
         'response': 3,
         'trump': 1,
         'presidency': 1,
         'aligning': 1,
         'organizational': 3,
         'goal': 12,
         'semantic': 6,
         'interoperability': 2,
         'infrastructure': 12,
         'healthcare': 14,
         'middleware': 1,
         'managing': 6,
         'pipelined': 1,
         'gpu': 3,
         'mapreduce': 8,
         'processing': 27,
         'evaluation': 5,
         'formalism': 1,
         'pmu': 2,
         'medical': 14,
         'evidence': 9,
         'generation': 9,
         'practice': 10,
         'post': 1,
         'marketing': 6,
         'jbhi': 2,
         'special': 36,
         'situating': 1,
         'science': 34,
         'advance': 5,
         'challenge': 43,
         'integrative': 1,
         'biology': 4,
         'centrism': 1,
         'mapping': 6,
         'bioscience': 1,
         'w': 2,
         'matrix': 3,
         'comparative': 3,
         'scientific': 9,
         'image': 5,
         'workload': 2,
         'facebook': 1,
         'fiasco': 1,
         'rethink': 2,
         'abnormal': 2,
         'operation': 5,
         'gram': 1,
         'schmidt': 1,
         'orthogonalization': 1,
         'n': 1,
         'propyl': 1,
         'propionate': 1,
         'simulated': 1,
         'moving': 6,
         'bed': 1,
         'reactor': 1,
         'israeli': 1,
         'firm': 6,
         'medaware': 1,
         'offer': 5,
         'software': 5,
         'detect': 2,
         'prescription': 2,
         'error': 6,
         'graduate': 2,
         'program': 1,
         'analyst': 1,
         'cover': 2,
         'era': 31,
         'putting': 3,
         'pressure': 2,
         'storage': 9,
         'backwards': 1,
         'gwas': 1,
         'smallholder': 2,
         'need': 6,
         'access': 5,
         'agronomy': 1,
         'ultralow': 1,
         'memory': 6,
         'nonvolatile': 3,
         'domain': 2,
         'wall': 1,
         'nanowire': 1,
         'design': 8,
         'implementation': 4,
         'spark': 5,
         'distribution': 10,
         'interception': 1,
         'seeking': 2,
         'alpha': 1,
         'dividend': 2,
         'announcement': 1,
         'insight': 8,
         'joining': 2,
         'car': 5,
         'eva': 2,
         'style': 2,
         'interview': 3,
         'mit': 3,
         'stefan': 1,
         'wrobel': 1,
         'zum': 3,
         'thema': 2,
         'angewandte': 1,
         'forschung': 1,
         'stochastic': 4,
         'approximation': 5,
         'statistical': 8,
         'origin': 1,
         'multidisciplinary': 1,
         'code': 3,
         'offloading': 1,
         'scheme': 6,
         'android': 1,
         'fast': 3,
         'relaxation': 1,
         'process': 21,
         'page': 1,
         'phylogenomics': 1,
         'principle': 1,
         'opportunity': 29,
         'pitfall': 3,
         'phylogenetics': 2,
         'deadline': 4,
         'aware': 6,
         'flexible': 3,
         'bandwidth': 1,
         'allocation': 5,
         'transfer': 2,
         'ad': 2,
         'exec': 1,
         'draper': 1,
         'education': 9,
         'grand': 3,
         'convergence': 2,
         'fitnessgram': 1,
         'digital': 15,
         'commerce': 3,
         'child': 3,
         'vergleich': 1,
         'von': 2,
         'kompetenzanforderungen': 1,
         'spezialisten': 1,
         'starting': 2,
         'think': 7,
         'clinical': 8,
         'detection': 12,
         'early': 1,
         'stage': 1,
         'bipolar': 1,
         'disorder': 1,
         'assessment': 9,
         'learning': 46,
         'pwr': 1,
         'pin': 1,
         'homogenized': 1,
         'cross': 2,
         'section': 1,
         'x': 3,
         'c': 3,
         'qualitative': 3,
         'readiness': 1,
         'requirement': 1,
         'enterprise': 8,
         'investment': 1,
         'economics': 1,
         'econophysics': 1,
         'fine': 3,
         'grained': 1,
         'predicting': 7,
         'pattern': 10,
         'large': 14,
         'scale': 13,
         'triadic': 1,
         'patent': 1,
         'kidney': 1,
         'disease': 9,
         'ck': 1,
         'net': 1,
         'dream': 2,
         'symposium': 1,
         'insure': 1,
         'tech': 1,
         'digitalization': 1,
         'technique': 11,
         'risk': 13,
         'management': 34,
         'insurance': 2,
         'note': 1,
         'hci': 1,
         'multimedia': 5,
         'randomized': 3,
         'block': 1,
         'proximal': 1,
         'kernel': 5,
         'sieben': 1,
         'schritten': 1,
         'erfolgreichen': 1,
         'projekt': 1,
         'message': 4,
         'agenda': 2,
         'setting': 1,
         'ownership': 3,
         'architecting': 1,
         'critical': 6,
         'computation': 3,
         'migration': 1,
         'new': 40,
         'execute': 1,
         'bioinformatics': 2,
         'workflow': 3,
         'trust': 1,
         'crowdfunding': 2,
         'campaign': 2,
         'theoretical': 6,
         'underpinnings': 1,
         'correction': 4,
         'predictive': 8,
         'bigger': 4,
         'really': 2,
         'junqu': 1,
         'de': 2,
         'fortuny': 1,
         'e': 7,
         'marten': 1,
         'provost': 1,
         'training': 3,
         'pedagogy': 1,
         'value': 11,
         'biosurveillance': 1,
         'natural': 2,
         'capital': 2,
         'accounting': 2,
         'diversity': 2,
         'review': 23,
         'scalable': 14,
         'quality': 10,
         'pythia': 1,
         'handling': 5,
         'missing': 1,
         'structural': 3,
         'consequence': 2,
         'bring': 1,
         'beyond': 3,
         'hype': 2,
         'role': 12,
         'teamwork': 1,
         'visual': 3,
         'box': 1,
         'office': 1,
         'prediction': 15,
         'scholarly': 5,
         'survey': 20,
         'boe': 1,
         'fusing': 1,
         'spanish': 1,
         'official': 1,
         'gazette': 1,
         'sport': 4,
         'toward': 6,
         'next': 9,
         'frontier': 4,
         'water': 4,
         'cycle': 1,
         'remote': 3,
         'sensing': 7,
         'overview': 3,
         'artificial': 4,
         'fusion': 5,
         'context': 10,
         'crown': 1,
         'epidemic': 1,
         'idea': 2,
         'geological': 1,
         'decision': 16,
         'making': 17,
         'author': 2,
         'gong': 1,
         'comment': 1,
         'computational': 8,
         'propaganda': 3,
         'paper': 30,
         'futuristic': 3,
         'fabric': 3,
         'societal': 3,
         'multiple': 3,
         'measurement': 9,
         'extended': 3,
         'binary': 2,
         'phase': 4,
         'shift': 2,
         'keying': 1,
         'transceiver': 1,
         'persona': 1,
         'shape': 1,
         'demographic': 2,
         'representation': 1,
         'user': 4,
         'segment': 2,
         'essential': 1,
         'urban': 7,
         'land': 2,
         'category': 1,
         'euluc': 1,
         'geospatial': 4,
         'progress': 2,
         'rf': 1,
         'localisation': 1,
         'search': 7,
         'rescue': 2,
         'guest': 12,
         'medium': 14,
         'part': 2,
         'biomedical': 3,
         'accurate': 4,
         'cluster': 5,
         'competitive': 6,
         'k': 8,
         'mean': 8,
         'privacy': 12,
         'secdedoop': 1,
         'secure': 3,
         'deduplication': 3,
         'hdfs': 1,
         'hadoop': 4,
         'environment': 12,
         'cognitive': 4,
         'green': 4,
         'space': 5,
         'actual': 1,
         'utilization': 4,
         'deep': 12,
         'assisted': 4,
         'earth': 24,
         'engagement': 1,
         'stakeholder': 3,
         'hungary': 1,
         'support': 11,
         'sdgs': 1,
         'atmospheric': 1,
         'ecosystem': 2,
         'providing': 1,
         'contribution': 1,
         'reaching': 1,
         'nation': 1,
         'sustainable': 17,
         'strength': 1,
         'number': 2,
         'simplify': 1,
         'sentiment': 4,
         'classification': 9,
         'lost': 2,
         'bee': 1,
         'colony': 1,
         'reinforced': 1,
         'kalman': 1,
         'filter': 3,
         'localization': 2,
         'thing': 16,
         'blending': 1,
         'finding': 4,
         'position': 3,
         'reference': 3,
         'node': 1,
         'labor': 2,
         'economist': 1,
         'get': 2,
         'microscope': 1,
         'agricultural': 5,
         'population': 4,
         'monitor': 1,
         'celebration': 2,
         'establishment': 3,
         'international': 6,
         'cbas': 2,
         'editor': 3,
         'chief': 1,
         'understand': 1,
         'condition': 2,
         'kavli': 1,
         'oceanography': 1,
         'continued': 3,
         'machine': 26,
         'engineering': 12,
         'rst': 1,
         'rough': 1,
         'set': 4,
         'differential': 1,
         'evolution': 5,
         'feature': 6,
         'biological': 1,
         'cybersecurity': 3,
         'behaviour': 1,
         'swimming': 1,
         'pool': 1,
         'impact': 15,
         'household': 1,
         'intensity': 7,
         'san': 1,
         'antonio': 1,
         'texas': 1,
         'exploiting': 2,
         'reused': 1,
         'sharing': 7,
         'work': 10,
         'multiquery': 1,
         'flink': 2,
         'robotics': 3,
         'literature': 5,
         'analyze': 2,
         'stock': 4,
         'dilemma': 1,
         'behavioral': 3,
         'ensemble': 4,
         'complex': 3,
         'iotbds': 1,
         'complexis': 1,
         'pervasive': 1,
         'care': 11,
         'programming': 3,
         'mr': 2,
         'dp': 1,
         'priority': 1,
         'u': 9,
         'army': 2,
         'person': 1,
         'event': 1,
         'military': 1,
         'civilian': 1,
         'monkeyking': 1,
         'parameter': 1,
         'tuning': 1,
         'reinforcement': 1,
         'soft': 2,
         'moth': 2,
         'flame': 2,
         'bat': 2,
         'map': 4,
         'reduce': 2,
         'clustering': 10,
         'sparse': 4,
         'fuzzy': 6,
         'bibliometric': 4,
         'relevant': 2,
         'remotely': 1,
         'sensed': 1,
         'ocean': 2,
         'polar': 1,
         'region': 4,
         'protecting': 1,
         'virtualized': 1,
         'unstructured': 2,
         'analytical': 2,
         'perception': 3,
         'good': 6,
         'optimal': 2,
         'firefly': 1,
         'lion': 1,
         'online': 15,
         'guangzhou': 1,
         'restaurant': 1,
         'school': 4,
         'relate': 1,
         'little': 11,
         'coordination': 1,
         'medicare': 1,
         'beneficiary': 1,
         'medigap': 1,
         'coverage': 2,
         'drive': 2,
         'introducing': 1,
         'perform': 1,
         'gsio': 1,
         'programmatic': 1,
         'interface': 2,
         'delivering': 1,
         'view': 5,
         'cube': 1,
         'semantics': 4,
         'ontology': 3,
         'knowledge': 12,
         'graph': 5,
         'core': 3,
         'processor': 4,
         'many': 1,
         'co': 2,
         'supercomputer': 1,
         'beacon': 1,
         'geocomputation': 1,
         'geo': 2,
         'simulation': 3,
         'first': 3,
         'experience': 3,
         'timescan': 1,
         'chain': 6,
         'climate': 2,
         'change': 3,
         'sentinel': 2,
         'landsat': 2,
         'imagery': 1,
         'settlement': 1,
         'stewardship': 1,
         'geosocial': 1,
         'crowdsourced': 1,
         'vector': 3,
         'generalized': 3,
         'supervised': 2,
         'produce': 1,
         'provincial': 1,
         'wetland': 1,
         'inventory': 1,
         'google': 4,
         'engine': 3,
         'facilitates': 1,
         'evaluating': 5,
         'partnership': 1,
         'increasing': 1,
         'australian': 1,
         'way': 6,
         'web': 8,
         'open': 5,
         'innovative': 1,
         'transport': 2,
         'pipeline': 3,
         'greater': 1,
         'sydney': 1,
         'achieving': 1,
         'belt': 2,
         'road': 1,
         'wechat': 1,
         'blood': 2,
         'donation': 2,
         'simd': 1,
         'parallel': 7,
         'mcmc': 1,
         'sampling': 4,
         'bayesian': 4,
         'educational': 2,
         'shuffle': 1,
         'biggest': 3,
         'repository': 2,
         'balloon': 1,
         'unprecedented': 1,
         'level': 2,
         'coping': 3,
         'sector': 6,
         'transforming': 6,
         'nearest': 4,
         'neighbor': 3,
         'obtain': 1,
         'meet': 6,
         'formal': 1,
         'practical': 4,
         'novel': 9,
         'paradigm': 4,
         'organisation': 2,
         'leverage': 3,
         'maturity': 1,
         'quantile': 1,
         'regression': 2,
         'divide': 2,
         'conquer': 1,
         'financial': 10,
         'double': 1,
         'log': 2,
         'skew': 1,
         'symmetric': 1,
         'result': 2,
         'output': 1,
         'attribute': 1,
         'designing': 2,
         'operating': 1,
         'discretization': 1,
         'taxonomy': 1,
         'always': 1,
         'benefit': 5,
         'geographical': 1,
         'build': 2,
         'em': 2,
         'automated': 1,
         'slicing': 1,
         'validation': 1,
         'integration': 4,
         'theme': 1,
         'crime': 1,
         'detailed': 1,
         'prominent': 1,
         'compression': 5,
         'used': 5,
         'reducing': 2,
         'volume': 2,
         'recombine': 1,
         'complexity': 1,
         'electrophysiology': 2,
         'collection': 2,
         'haery': 1,
         'query': 3,
         'accumulative': 1,
         'dimensional': 3,
         'gamma': 1,
         'summarize': 1,
         'dense': 1,
         'experimental': 2,
         'kansei': 1,
         'hotel': 1,
         'criminal': 2,
         'done': 1,
         'portfolio': 2,
         'successful': 1,
         'swarm': 3,
         'single': 2,
         'cell': 4,
         'transcriptomics': 1,
         'five': 3,
         'cyber': 2,
         'acquired': 1,
         'multichannel': 1,
         'sensor': 7,
         'active': 1,
         'grid': 6,
         'industry': 16,
         'academic': 2,
         'librarian': 4,
         'competency': 2,
         'skill': 2,
         'library': 1,
         'chicken': 1,
         'foraging': 1,
         'belief': 2,
         'classifier': 3,
         'entity': 1,
         'communication': 6,
         'peer': 2,
         'produced': 1,
         'implicit': 2,
         'collaboration': 6,
         'wikipedia': 1,
         'apache': 4,
         'revised': 1,
         'pyramid': 1,
         'conference': 2,
         'bigcomp': 1,
         'improves': 1,
         'michael': 2,
         'buck': 2,
         'talk': 1,
         'living': 1,
         'bioinformaticist': 1,
         'emerging': 3,
         'alternative': 2,
         'warehouse': 4,
         'partner': 1,
         'allina': 1,
         'team': 5,
         'catalyst': 2,
         'garner': 1,
         'dimensionality': 1,
         'reduction': 3,
         'problem': 11,
         'sdn': 1,
         'coal': 1,
         'fired': 1,
         'plant': 1,
         'fundamental': 1,
         'mongodb': 1,
         'modular': 1,
         'building': 7,
         'deal': 5,
         'heavy': 1,
         'tailed': 1,
         'rule': 4,
         'granularity': 1,
         'phone': 1,
         'compute': 2,
         'scenario': 4,
         'retail': 1,
         'apriori': 1,
         'straggler': 1,
         'rescheduling': 1,
         'slow': 1,
         'running': 1,
         'task': 2,
         'lfcso': 1,
         'lvq': 1,
         'enhanced': 4,
         'pso': 1,
         'subdata': 1,
         'unlocking': 1,
         'causal': 2,
         'relation': 2,
         'barrier': 2,
         'skyline': 1,
         'telecom': 2,
         'fraud': 2,
         ...})
         
word_count = dict()

for tag, counts in count.most_common(50):
    if(len(str(tag))>1):
        word_count[tag] = counts
        print("%s : %d % (tag, counts")
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts
%s : %d % (tag, counts

sorted_Keys = sorted(word_count, key = word_count.get, reverse = True)
sorted_Values = sorted(word_count.values(), reverse = True)
plt.bar(range(len(word_count)), sorted_Values, align = 'center')
plt.xticks(range(len(word_count)), list(sorted_Keys), rotation = '85')
plt.show
<function matplotlib.pyplot.show(*args, **kw)>

결과 시각화

all_files_data_concat['doc_count'] = 0
summary_year = all_files_data_concat.groupby('출판일', as_index = False)['doc_count'].count()
summary_year    #출력하여 내용 화인

plt.figure(figsize = (12, 5))
plt.xlabel("year")
plt.ylabel("doc-count")
plt.grid(True)
plt.plot(range(len(summary_year)), summary_year['doc_count'])
plt.xticks(range(len(summary_year)), [text for text in summary_year['출판일']])
plt.show()

stopwords = set(STOPWORDS)
wc = WordCloud(background_color = 'ivory', stopwords = stopwords, width = 800, height = 600)
cloud = wc.generate_from_frequencies(word_count)
plt.figure(figsize = (8,8))
plt.imshow(cloud)
plt.axis('off')
plt.show
<function matplotlib.pyplot.show(*args, **kw)>

cloud.to_file("riss_bigdata_wordCloud.jpg")
<wordcloud.wordcloud.WordCloud at 0x7f71306177c0>

02 [한글 분석 + 워드클라우드]

한글 뉴스 기사의 키워드 분석하기

!pip install konlpy
Looking in indexes: https://pypi.org/simple, https://us-python.pkg.dev/colab-wheels/public/simple/
Requirement already satisfied: konlpy in /usr/local/lib/python3.8/dist-packages (0.6.0)
Requirement already satisfied: lxml>=4.1.0 in /usr/local/lib/python3.8/dist-packages (from konlpy) (4.9.2)
Requirement already satisfied: JPype1>=0.7.0 in /usr/local/lib/python3.8/dist-packages (from konlpy) (1.4.1)
Requirement already satisfied: numpy>=1.6 in /usr/local/lib/python3.8/dist-packages (from konlpy) (1.21.6)
Requirement already satisfied: packaging in /usr/local/lib/python3.8/dist-packages (from JPype1>=0.7.0->konlpy) (21.3)
Requirement already satisfied: pyparsing!=3.0.5,>=2.0.2 in /usr/local/lib/python3.8/dist-packages (from packaging->JPype1>=0.7.0->konlpy) (3.0.9)

import json
import re
from konlpy.tag import Okt
from collections import Counter
import matplotlib
import matplotlib.pyplot as plt
from matplotlib import font_manager, rc
from wordcloud import WordCloud

inputFileName = 'etnews.kr_facebook_2016-01-01_2018-08-01_4차 산업혁명'
data = json.loads(open(inputFileName+'.json', 'r', encoding = 'utf-8').read())
data    #출력하여 내용 확인
[{'created_time': '2018-06-20 18:06:39',
  'link': 'https://www.facebook.com/etnews.kr/videos/1981346601899735/',
  'message': '6월의 스파크포럼 - "미래 시대, 조직의 변화도 시작됐다!"\n\n스파크포럼은 현 사회의 사회문제 및 이슈를 제기하고, 그 이슈를 혁신적으로 해결하고자 하는 소셜이노베이터를 발굴, 지원하여 우리 사회 따뜻한 변화를 확산시키지 위해 만들어진 도전과 만남의 자리입니다.\n\n6월의 스파크포럼에서는 4차 산업혁명 시대의 기업조직과 조직문화를 살펴보고, 조직의 변화를 받아들이고 실험해나가는 사례를 통해 미래 시대 조직이 나아가야 할 방향을 함께 생각해보고자 합니다.',
  'name': '6월의 스파크포럼 - "미래 시대, 조직의 변화도 시작됐다!"',
  'post_id': '407886705912407_1981346601899735',
  'total_comments': 3},
 {'created_time': '2018-06-14 10:41:16',
  'link': 'http://www.etnews.com/20180612000347',
  'message': '로봇이 4차 산업혁명 주요 성장 동력으로 떠오르면서 국내 로봇 기업에 재평가가 이뤄지고 있다는 분석입니다.',
  'name': '기술력 갖춘 로봇기업 몸값 치솟는다',
  'post_id': '407886705912407_1971252229575839',
  'total_comments': 0},
 {'created_time': '2018-04-10 17:42:00',
  'link': 'http://www.etnews.com/20180409000360?mc=em_002_00001',
  'message': '4차 산업혁명 시대 금융 산업 전반 점검해야 한다는...',
  'name': "휘청거리는 대한민국 '금융'...서민도 기업도 ICT도 실종",
  'post_id': '407886705912407_1900738133293916',
  'total_comments': 0},
 {'created_time': '2018-03-05 12:05:00',
  'link': 'https://goo.gl/qPEyAJ',
  'message': "공통 간판 공약은 '4차 산업혁명 선도 도시 구현'과 '가상화폐·블록체인 기술 활용' 등입니다. https://goo.gl/qPEyAJ",
  'name': "지방선거 D-100 정책 공약 '4차산업혁명'에 초점",
  'post_id': '407886705912407_1859978837369846',
  'total_comments': 0},
 {'created_time': '2018-02-05 17:20:01',
  'link': 'http://www.etnews.com/20180205000237',
  'message': '이번 과정은 4차 산업혁명 핵심 기술인 AI에 대한 통찰력과 알파고를 통해 우리 일상과 익숙해진 로봇, 빅데이터 산업 흐름을 알려줍니다. #인공지능 #최고위과정 #산학연최고전문가 https://goo.gl/jSZSNX',
  'name': '[알림]제2기 인공지능 최고위 과정 모집',
  'post_id': '407886705912407_1829059807128416',
  'total_comments': 0},
 {'created_time': '2018-01-29 23:39:00',
  'link': 'http://www.etnews.com/20180129000403?mc=em_003_00001',
  'message': '4차 산업혁명 모든 기술과 서비스 구현',
  'name': "첫 스마트시티 '부산·세종' 낙점...규제 프리존으로",
  'post_id': '407886705912407_1821177084583355',
  'total_comments': 0},
 {'created_time': '2017-11-09 18:11:43',
  'link': 'http://www.sek.co.kr/2017/nbf',
  'message': '산업혁명의 나라, 영국엔 4차 산업혁명이 없다?\n한국-영국 경험과 지혜를 모으는 ‘미래비즈니스포럼 2017’ 개최',
  'name': '미래비즈니스포럼 2017 : 손에 잡히는 4차 산업혁명',
  'post_id': '407886705912407_1735224763178588',
  'total_comments': 0},
 {'created_time': '2017-10-25 13:08:49',
  'link': 'https://www.facebook.com/etnews.kr/videos/1720053844695680/',
  'message': "[전자신문TV 라이브] SBA 신직업위크\n\n4차 산업혁명 시대의 경쟁력 확보와 일자리 창출의 해법을 신직업에서 찾고 있는 서울산업진흥원(SBA)이 자신들의 역량과 노력을 모두 담은 '제 2회 신직업위크'를 진행하고 있습니다.\n\n이번 전자신문TV라이브는 서울 대치동 SBA 신직업교육센터에서 강만구 신직업교육팀장과 서지윤 신직업리서치팀수석, 윤석원 테스트웍스 대표 등과 함께 '제 2회 신직업위크'와 최신 신직업트렌드를 알아보는 시간으로 마련했습니다. 많은 시청 바랍니다. \n\n#전자신문 #전자신문엔터 #전자신문TV #라이브 #소셜방송 #현장라이브 #손보련 #신직업위크 #서울산업진흥원 #SBA #신직업 #서울 #강만구 #서지윤 #윤석원 #테스트웍스 #소프트웨어 #테스터 #소프트웨어테스터",
  'name': '[전자신문TV 라이브] SBA 신직업위크',
  'post_id': '407886705912407_1720053844695680',
  'total_comments': 0},
 {'created_time': '2017-10-06 11:50:00',
  'link': 'http://www.etnews.com/20171005000056?mc=em_011_00001',
  'message': '혹시 우리나라도?~~4차 산업혁명 시대 떠오르는 일자리 전략으로 강추~~',
  'name': '日, 인공지능 활용능력 자격시험 만든다',
  'post_id': '407886705912407_1700789646622100',
  'total_comments': 1},
 {'created_time': '2017-09-25 15:30:00',
  'link': 'http://www.etnews.com/20170924000106?mc=em_003_00001',
  'message': '4차 산업혁명 대응에 가장 강점 분야는 5세대 이동통신...건강한 산업 생태계 조성 노력 강조',
  'name': "김상조 공정위원장 “기업집단국, 조사국과 달라…'존경받는 기업' 만드는 게 핵심”",
  'post_id': '407886705912407_1691461310888267',
  'total_comments': 0},
 {'created_time': '2017-08-29 00:54:00',
  'link': 'http://www.etnews.com/20170825000162?mc=em_009_00001',
  'message': "중앙부처 공무원 51.3% '4차 산업혁명 관련 가장 시급히 대응해야 할 기술적 과제'로 빅데이터 활용 꼽",
  'name': '현직 공무원들 "빅데이터·AI, 정책에 적극 활용"…4차 산업혁명 인식 높아',
  'post_id': '407886705912407_1665258370175228',
  'total_comments': 1},
 {'created_time': '2017-08-17 15:37:14',
  'link': 'https://www.facebook.com/etnews.kr/videos/1655135431187522/',
  'message': '[전자신문TV 라이브] 직격인터뷰 - 서울산업진흥원(SBA) 주형철 대표이사\n\n최근 각급 공공기관들이 4차 산업혁명기를 맞은 국내 중소기업의 경쟁력과 일자리 창출을 위해 노력하고 있는데요.\n\n이번 전자신문TV 라이브는 중소기업 활성화와 일자리 창출에 앞장서는 공공기관 중 대표적 사례로 꼽히는 서울산업진흥원 주형철 대표이사를 모시고 다양한 이야기를 듣는 시간으로 진행됩니다.\n\n#전자신문 #전자신문TV #서울산업진흥원 #SBA #주형철 #소성렬',
  'name': '[전자신문TV 라이브] 직격인터뷰 - 서울산업진흥원(SBA) 주형철 대표이사',
  'post_id': '407886705912407_1655135431187522',
  'total_comments': 15},
 {'created_time': '2017-08-12 17:48:00',
  'link': 'http://www.etnews.com/20170810000414?mc=em_009_00001',
  'message': '4차 산업혁명 대응은 일자리 창출과 함께 문재인 정부 주요 국정 과제인 데도 우선순위에서 밀렸다는 지적',
  'name': '4차 산업혁명위 출범 후순위로 밀렸다',
  'post_id': '407886705912407_1648865381814527',
  'total_comments': 0},
 {'created_time': '2017-06-09 22:00:00',
  'link': 'http://www.etnews.com/20170608000300?mc=em_009_00001',
  'message': '통합·개혁 행보에서 빠른 움직임을 보였지만 4차 산업혁명 등 신성장동력 창출에서는 한 발짝도 못 나갔다는 평입니다...',
  'name': "새정부 출범 한 달...'통합·개혁'엔 진일보, '4차 산업혁명 대응' 한발짝도 못 나가",
  'post_id': '407886705912407_1579376925430040',
  'total_comments': 4},
 {'created_time': '2017-05-24 18:30:00',
  'link': 'http://www.etnews.com/20170524000281?mc=em_001_00001',
  'message': "미래창조과학부가 주최하고 전자신문사, 한국경제신문, 한국정보통신진흥협회(KAIT) 주관으로 24일 서울 강남구 삼성동 코엑스에서 개막된 '월드IT쇼(WIS) 2017'은 4차 산업혁명이 더 이상 개념이 아니라 현실로 다가왔음을 입증했습니다~",
  'name': '[WIS 2017]4차 산업혁명, 현실과 마주한 날',
  'post_id': '407886705912407_1562807837086949',
  'total_comments': 0},
 {'created_time': '2017-05-17 18:38:04',
  'link': 'https://www.facebook.com/etnews.kr/photos/a.409377469096664/1556319464402453/?type=3',
  'message': '[전자신문TV 라이브 예고]\n이번 전자신문TV는 4차 산업혁명과 과학기술 영상컨퍼런스를 생중계합니다~! 광운대학교 이승현 교수님을 모시고 진행하는 이번 행사는 VR. AR 기술과 몰힙형 미디어에 대해 설명해 주신다고 합니다!\n\n이번 행사는 전자신문 Entertainment와 가상현실 스튜디오 솔루션을 제작한 다림비젼, 중견기업TV와 함께  4차 산업 정보 채널 IBSB(IT Tech. & Business SNS Broadcast)를 통해 진행합니다.\n\nIBSB는 전세계 4차 산업 혁명의 현장의 생생한 IT 소식과 정보, 컨퍼런스의 강연, VOD정보를 VR 기술과 원격 Live방송 기술을 중심으로 시간과 거리를 초월하는 실시간 TELE-Presentation 기술의 새로운 방송 서비스입니다.\n\n이런 유익한 강의를 전자신문 페이스북을 통해서 시청하실 수 있습니다~ 많은 관심부탁드립니다♥',
  'name': 'Timeline Photos',
  'post_id': '407886705912407_1556319464402453',
  'total_comments': 1},
 {'created_time': '2017-04-05 07:00:00',
  'link': 'http://www.etnews.com/20170404000230',
  'message': '대선 국면이지만 법안의 시급성 때문에 4차 산업혁명 대비 법제 개편 논의가 불붙을 전망입니다...',
  'name': "'4차 산업혁명 기본법' 나왔다…대선 기간 법제화 논의 불붙을듯",
  'post_id': '407886705912407_1505222409512159',
  'total_comments': 1},
 {'created_time': '2017-02-14 08:30:01',
  'link': 'http://www.etnews.com/20170213000307',
  'message': '4차 산업혁명 시대 대비!!!',
  'name': '[차기 정부 거버넌스 개편 방향 좌담회]"400조 정부, 혁신부총리가 답이다"',
  'post_id': '407886705912407_1452393468128387',
  'total_comments': 0},
 {'created_time': '2016-01-25 06:00:00',
  'link': 'http://www.etnews.com/20160124000075',
  'message': '‘#4차산업혁명 의 이해(Mastering the Fourth Industrial Revolution)’...\n23일 폐막한 #다보스포럼 에서는  4차 산업혁명 기대와 우려가 교차했는데요. 구글, 애플, 페이스북, 삼성전자, LG전자가 앞다퉈 4차 산업혁명 흐름에 뛰어 들고 있습니다.',
  'name': '[이슈분석]다보스포럼, 4차 산업혁명 기대와 우려 교차',
  'post_id': '407886705912407_1106254916075579',
  'total_comments': 0}]
  
 message = ''
for item in data:
    if 'message' in item.keys():
        message = message + re.sub(r'[^\w]', ' ', item['message']) +''
message #출력하여 내용 확인
6월의 스파크포럼    미래 시대  조직의 변화도 시작됐다    스파크포럼은 현 사회의 사회문제 및 이슈를 제기하고  그 이슈를 혁신적으로 해결하고자 하는 소셜이노베이터를 발굴  지원하여 우리 사회 따뜻한 변화를 확산시키지 위해 만들어진 도전과 만남의 자리입니다   6월의 스파크포럼에서는 4차 산업혁명 시대의 기업조직과 조직문화를 살펴보고  조직의 변화를 받아들이고 실험해나가는 사례를 통해 미래 시대 조직이 나아가야 할 방향을 함께 생각해보고자 합니다 로봇이 4차 산업혁명 주요 성장 동력으로 떠오르면서 국내 로봇 기업에 재평가가 이뤄지고 있다는 분석입니다 4차 산업혁명 시대 금융 산업 전반 점검해야 한다는   공통 간판 공약은  4차 산업혁명 선도 도시 구현 과  가상화폐 블록체인 기술 활용  등입니다  https   goo gl qPEyAJ이번 과정은 4차 산업혁명 핵심 기술인 AI에 대한 통찰력과 알파고를 통해 우리 일상과 익숙해진 로봇  빅데이터 산업 흐름을 알려줍니다   인공지능  최고위과정  산학연최고전문가 https   goo gl jSZSNX4차 산업혁명 모든 기술과 서비스 구현산업혁명의 나라  영국엔 4차 산업혁명이 없다  한국 영국 경험과 지혜를 모으는  미래비즈니스포럼 2017  개최 전자신문TV 라이브  SBA 신직업위크  4차 산업혁명 시대의 경쟁력 확보와 일자리 창출의 해법을 신직업에서 찾고 있는 서울산업진흥원 SBA 이 자신들의 역량과 노력을 모두 담은  제 2회 신직업위크 를 진행하고 있습니다   이번 전자신문TV라이브는 서울 대치동 SBA 신직업교육센터에서 강만구 신직업교육팀장과 서지윤 신직업리서치팀수석  윤석원 테스트웍스 대표 등과 함께  제 2회 신직업위크 와 최신 신직업트렌드를 알아보는 시간으로 마련했습니다  많은 시청 바랍니다     전자신문  전자신문엔터  전자신문TV  라이브  소셜방송  현장라이브  손보련  신직업위크  서울산업진흥원  SBA  신직업  서울  강만구  서지윤  윤석원  테스트웍스  소프트웨어  테스터 

nlp = Okt()
message_N = nlp.nouns(message)
message_N    #출력하여 내용 확인
['스파크',
 '포럼',
 '미래',
 '시대',
 '조직',
 '변화',
 '시작',
 '스파크',
 '포럼',
 '현',
 '사회',
 '사회',
 '문제',
 '및',
 '이슈',
 '제기',
 '그',
 '이슈',
 '혁신',
 '해결',
 '소셜',
 '이노',
 '베이',
 '터',
 '발굴',
 '지원',
 '우리',
 '사회',
 '변화',
 '확산',
 '위해',
 '도전',
 '만남',
 '자리',
 '스파크',
 '포럼',
 '차',
 '산업혁명',
 '시대',
 '기업',
 '조직',
 '직문',
 '조직',
 '변화',
 '실험',
 '사례',
 '통해',
 '미래',
 '시대',
 '조직',
 '방향',
 '생각',
 '로봇',
 '차',
 '산업혁명',
 '주요',
 '성장',
 '동력',
 '국내',
 '로봇',
 '기업',
 '재',
 '평가',
 '분석',
 '차',
 '산업혁명',
 '시대',
 '금융',
 '산업',
 '전반',
 '점검',
 '공통',
 '간판',
 '공약',
 '차',
 '산업혁명',
 '선도',
 '도시',
 '구현',
 '과',
 '가상',
 '화폐',
 '블록',
 '체인',
 '기술',
 '활용',
 '등',
 '이번',
 '과정',
 '차',
 '산업혁명',
 '핵심',
 '기술',
 '대한',
 '통찰',
 '알파',
 '통해',
 '우리',
 '일상',
 '로봇',
 '빅데이터',
 '산업',
 '흐름',
 '인공',
 '지능',
 '최고',
 '위',
 '과정',
 '산학',
 '최고',
 '전문가',
 '차',
 '산업혁명',
 '모든',
 '기술',
 '서비스',
 '산업혁명',
 '나라',
 '영국',
 '차',
 '산업혁명',
 '한국',
 '영국',
 '경험',
 '지혜',
 '미래',
 '비즈니스',
 '포럼',
 '개최',
 '전자신문',
 '라이브',
 '직업',
 '위',
 '차',
 '산업혁명',
 '시대',
 '경쟁력',
 '확보',
 '일자리',
 '창',
 '해법',
 '직업',
 '산업',
 '진흥',
 '이',
 '자신',
 '역량',
 '노력',
 '모두',
 '제',
 '직업',
 '위',
 '를',
 '진행',
 '이번',
 '전자신문',
 '라이브',
 '서울',
 '대치동',
 '직업',
 '교육',
 '센터',
 '강',
 '만구',
 '직업',
 '교육',
 '팀',
 '지윤',
 '직업',
 '리서치',
 '팀',
 '수석',
 '윤석',
 '테스트',
 '웍스',
 '대표',
 '등',
 '제',
 '직업',
 '위',
 '최신',
 '직업',
 '트렌드',
 '시간',
 '마련',
 '시청',
 '전자신문',
 '전자신문',
 '터',
 '전자신문',
 '라이브',
 '소셜',
 '방송',
 '현장',
 '라이브',
 '손',
 '보련',
 '직업',
 '위',
 '산업',
 '진흥',
 '직업',
 '서울',
 '강',
 '만구',
 '지윤',
 '윤석',
 '테스트',
 '웍스',
 '소프트웨어',
 '테스',
 '터',
 '소프트웨어',
 '테스',
 '터',
 '혹시',
 '우리나라',
 '차',
 '산업혁명',
 '시대',
 '일자리',
 '전략',
 '강추',
 '차',
 '산업혁명',
 '대응',
 '가장',
 '강점',
 '분야',
 '세대',
 '이동통신',
 '산업',
 '생태계',
 '조성',
 '노력',
 '강조',
 '중앙',
 '부처',
 '공무원',
 '차',
 '산업혁명',
 '관련',
 '가장',
 '대응',
 '기술',
 '과제',
 '로',
 '빅데이터',
 '활용',
 '꼽',
 '전자신문',
 '라이브',
 '직',
 '격인',
 '터뷰',
 '산업',
 '진흥',
 '주형',
 '철',
 '대표이사',
 '최근',
 '급',
 '공공기관',
 '차',
 '산업혁명',
 '국내',
 '중소기업',
 '경쟁력',
 '일자리',
 '창',
 '위해',
 '노력',
 '이번',
 '전자신문',
 '라이브',
 '중소기업',
 '활성화',
 '일자리',
 '창',
 '공공기관',
 '중',
 '대표',
 '사례',
 '산업',
 '진흥',
 '주형',
 '철',
 '대표이사',
 '모시',
 '이야기',
 '시간',
 '진행',
 '전자신문',
 '전자신문',
 '산업',
 '진흥',
 '주형',
 '철',
 '렬',
 '차',
 '산업혁명',
 '대응',
 '일자리',
 '창',
 '문재인',
 '정부',
 '주요',
 '국정',
 '과제',
 '우선',
 '순위',
 '지적',
 '통합',
 '개혁',
 '행보',
 '움직임',
 '차',
 '산업혁명',
 '등',
 '성장동력',
 '창',
 '발짝',
 '못',
 '평',
 '미래창조과학부',
 '주최',
 '전자',
 '신문사',
 '한국',
 '경제',
 '신문',
 '국정',
 '보통신',
 '진흥',
 '협회',
 '주관',
 '서울',
 '강남구',
 '삼성동',
 '코엑스',
 '개막',
 '월드',
 '쇼',
 '은',
 '차',
 '산업혁명',
 '더',
 '이상',
 '개념',
 '현실',
 '음',
 '입증',
 '전자신문',
 '라이브',
 '예고',
 '이번',
 '전자신문',
 '차',
 '산업혁명',
 '과학기술',
 '영상',
 '컨퍼런스',
 '생중계',
 '광운대',
 '학교',
 '이승현',
 '교수',
 '모시',
 '진행',
 '이번',
 '행사',
 '기술',
 '몰힙형',
 '미디어',
 '대해',
 '설명',
 '이번',
 '행사',
 '전자신문',
 '가상현실',
 '스튜디오',
 '솔루션',
 '제작',
 '다림',
 '비젼',
 '중견',
 '기업',
 '차',
 '산업',
 '정보',
 '채널',
 '를',
 '통해',
 '진행',
 '전세계',
 '차',
 '산업',
 '혁명',
 '현장',
 '소식',
 '정보',
 '컨퍼런스',
 '강연',
 '정보',
 '기술',
 '원격',
 '방송',
 '기술',
 '중심',
 '시간',
 '거리',
 '초월',
 '실시간',
 '기술',
 '방송',
 '서비스',
 '강의',
 '전자신문',
 '페이스북',
 '통해',
 '시청',
 '수',
 '관심',
 '대선',
 '국면',
 '법안',
 '시급',
 '때문',
 '차',
 '산업혁명',
 '대비',
 '법제',
 '개편',
 '논의',
 '불',
 '전망',
 '차',
 '산업혁명',
 '시대',
 '대비',
 '차',
 '산업혁명',
 '의',
 '이해',
 '폐막',
 '다보스',
 '포럼',
 '차',
 '산업혁명',
 '기대',
 '우려',
 '교차',
 '구글',
 '애플',
 '페이스북',
 '삼성',
 '전자',
 '전자',
 '앞',
 '차',
 '산업혁명',
 '흐름']
 
count = Counter(message_N)
count   #출력하여 내용 확인
Counter({'스파크': 3,
         '포럼': 5,
         '미래': 3,
         '시대': 7,
         '조직': 4,
         '변화': 3,
         '시작': 1,
         '현': 1,
         '사회': 3,
         '문제': 1,
         '및': 1,
         '이슈': 2,
         '제기': 1,
         '그': 1,
         '혁신': 1,
         '해결': 1,
         '소셜': 2,
         '이노': 1,
         '베이': 1,
         '터': 4,
         '발굴': 1,
         '지원': 1,
         '우리': 2,
         '확산': 1,
         '위해': 2,
         '도전': 1,
         '만남': 1,
         '자리': 1,
         '차': 23,
         '산업혁명': 22,
         '기업': 3,
         '직문': 1,
         '실험': 1,
         '사례': 2,
         '통해': 4,
         '방향': 1,
         '생각': 1,
         '로봇': 3,
         '주요': 2,
         '성장': 1,
         '동력': 1,
         '국내': 2,
         '재': 1,
         '평가': 1,
         '분석': 1,
         '금융': 1,
         '산업': 10,
         '전반': 1,
         '점검': 1,
         '공통': 1,
         '간판': 1,
         '공약': 1,
         '선도': 1,
         '도시': 1,
         '구현': 1,
         '과': 1,
         '가상': 1,
         '화폐': 1,
         '블록': 1,
         '체인': 1,
         '기술': 8,
         '활용': 2,
         '등': 3,
         '이번': 6,
         '과정': 2,
         '핵심': 1,
         '대한': 1,
         '통찰': 1,
         '알파': 1,
         '일상': 1,
         '빅데이터': 2,
         '흐름': 2,
         '인공': 1,
         '지능': 1,
         '최고': 2,
         '위': 5,
         '산학': 1,
         '전문가': 1,
         '모든': 1,
         '서비스': 2,
         '나라': 1,
         '영국': 2,
         '한국': 2,
         '경험': 1,
         '지혜': 1,
         '비즈니스': 1,
         '개최': 1,
         '전자신문': 13,
         '라이브': 7,
         '직업': 10,
         '경쟁력': 2,
         '확보': 1,
         '일자리': 5,
         '창': 5,
         '해법': 1,
         '진흥': 6,
         '이': 1,
         '자신': 1,
         '역량': 1,
         '노력': 3,
         '모두': 1,
         '제': 2,
         '를': 2,
         '진행': 4,
         '서울': 3,
         '대치동': 1,
         '교육': 2,
         '센터': 1,
         '강': 2,
         '만구': 2,
         '팀': 2,
         '지윤': 2,
         '리서치': 1,
         '수석': 1,
         '윤석': 2,
         '테스트': 2,
         '웍스': 2,
         '대표': 2,
         '최신': 1,
         '트렌드': 1,
         '시간': 3,
         '마련': 1,
         '시청': 2,
         '방송': 3,
         '현장': 2,
         '손': 1,
         '보련': 1,
         '소프트웨어': 2,
         '테스': 2,
         '혹시': 1,
         '우리나라': 1,
         '전략': 1,
         '강추': 1,
         '대응': 3,
         '가장': 2,
         '강점': 1,
         '분야': 1,
         '세대': 1,
         '이동통신': 1,
         '생태계': 1,
         '조성': 1,
         '강조': 1,
         '중앙': 1,
         '부처': 1,
         '공무원': 1,
         '관련': 1,
         '과제': 2,
         '로': 1,
         '꼽': 1,
         '직': 1,
         '격인': 1,
         '터뷰': 1,
         '주형': 3,
         '철': 3,
         '대표이사': 2,
         '최근': 1,
         '급': 1,
         '공공기관': 2,
         '중소기업': 2,
         '활성화': 1,
         '중': 1,
         '모시': 2,
         '이야기': 1,
         '렬': 1,
         '문재인': 1,
         '정부': 1,
         '국정': 2,
         '우선': 1,
         '순위': 1,
         '지적': 1,
         '통합': 1,
         '개혁': 1,
         '행보': 1,
         '움직임': 1,
         '성장동력': 1,
         '발짝': 1,
         '못': 1,
         '평': 1,
         '미래창조과학부': 1,
         '주최': 1,
         '전자': 3,
         '신문사': 1,
         '경제': 1,
         '신문': 1,
         '보통신': 1,
         '협회': 1,
         '주관': 1,
         '강남구': 1,
         '삼성동': 1,
         '코엑스': 1,
         '개막': 1,
         '월드': 1,
         '쇼': 1,
         '은': 1,
         '더': 1,
         '이상': 1,
         '개념': 1,
         '현실': 1,
         '음': 1,
         '입증': 1,
         '예고': 1,
         '과학기술': 1,
         '영상': 1,
         '컨퍼런스': 2,
         '생중계': 1,
         '광운대': 1,
         '학교': 1,
         '이승현': 1,
         '교수': 1,
         '행사': 2,
         '몰힙형': 1,
         '미디어': 1,
         '대해': 1,
         '설명': 1,
         '가상현실': 1,
         '스튜디오': 1,
         '솔루션': 1,
         '제작': 1,
         '다림': 1,
         '비젼': 1,
         '중견': 1,
         '정보': 3,
         '채널': 1,
         '전세계': 1,
         '혁명': 1,
         '소식': 1,
         '강연': 1,
         '원격': 1,
         '중심': 1,
         '거리': 1,
         '초월': 1,
         '실시간': 1,
         '강의': 1,
         '페이스북': 2,
         '수': 1,
         '관심': 1,
         '대선': 1,
         '국면': 1,
         '법안': 1,
         '시급': 1,
         '때문': 1,
         '대비': 2,
         '법제': 1,
         '개편': 1,
         '논의': 1,
         '불': 1,
         '전망': 1,
         '의': 1,
         '이해': 1,
         '폐막': 1,
         '다보스': 1,
         '기대': 1,
         '우려': 1,
         '교차': 1,
         '구글': 1,
         '애플': 1,
         '삼성': 1,
         '앞': 1})
         
word_count = dict()
for tag, counts in count.most_common(80):
    if(len(str(tag))>1):
        word_count[tag] = counts
        print("%s : %d" % (tag, counts))
산업혁명 : 22
전자신문 : 13
산업 : 10
직업 : 10
기술 : 8
시대 : 7
라이브 : 7
이번 : 6
진흥 : 6
포럼 : 5
일자리 : 5
조직 : 4
통해 : 4
진행 : 4
스파크 : 3
미래 : 3
변화 : 3
사회 : 3
기업 : 3
로봇 : 3
노력 : 3
서울 : 3
시간 : 3
방송 : 3
대응 : 3
주형 : 3
전자 : 3
정보 : 3
이슈 : 2
소셜 : 2
우리 : 2
위해 : 2
사례 : 2
주요 : 2
국내 : 2
활용 : 2
과정 : 2
빅데이터 : 2
흐름 : 2
최고 : 2
서비스 : 2
영국 : 2
한국 : 2
경쟁력 : 2
교육 : 2
만구 : 2
지윤 : 2
윤석 : 2
테스트 : 2
웍스 : 2
대표 : 2
시청 : 2
현장 : 2
소프트웨어 : 2
테스 : 2
가장 : 2
과제 : 2
대표이사 : 2
공공기관 : 2
중소기업 : 2
모시 : 2
국정 : 2
컨퍼런스 : 2
행사 : 2
페이스북 : 2
대비 : 2
시작 : 1
문제 : 1

!sudo apt-get install -y fonts-nanum
!sudo fc-cache -fv
!rm ~/.cache/matplotlib -rf
Reading package lists... Done
Building dependency tree       
Reading state information... Done
fonts-nanum is already the newest version (20170925-1).
The following package was automatically installed and is no longer required:
  libnvidia-common-460
Use 'sudo apt autoremove' to remove it.
0 upgraded, 0 newly installed, 0 to remove and 21 not upgraded.
/usr/share/fonts: caching, new cache contents: 0 fonts, 1 dirs
/usr/share/fonts/truetype: caching, new cache contents: 0 fonts, 3 dirs
/usr/share/fonts/truetype/humor-sans: caching, new cache contents: 1 fonts, 0 dirs
/usr/share/fonts/truetype/liberation: caching, new cache contents: 16 fonts, 0 dirs
/usr/share/fonts/truetype/nanum: caching, new cache contents: 10 fonts, 0 dirs
/usr/local/share/fonts: caching, new cache contents: 0 fonts, 0 dirs
/root/.local/share/fonts: skipping, no such directory
/root/.fonts: skipping, no such directory
/var/cache/fontconfig: cleaning cache directory
/root/.cache/fontconfig: not cleaning non-existent cache directory
/root/.fontconfig: not cleaning non-existent cache directory
fc-cache: succeeded

plt.rc('font', family ='NanumBarunGothic')

plt.figure(figsize = (12, 5))
plt.xlabel('키워드')
plt.ylabel('빈도수')
plt.grid(True)
sorted_Keys = sorted(word_count, key = word_count.get, reverse = True)
sorted_Values = sorted(word_count.values(), reverse = True)
plt.bar(range(len(word_count)), sorted_Values, align = 'center')
plt.xticks(range(len(word_count)), list(sorted_Keys), rotation = '75')
plt.show()

wc = WordCloud('NanumBarunGothic', background_color = 'ivory', width = 800, height = 600)
cloud = wc.generate_from_frequencies(word_count)
plt.figure(figsize = (8, 8))
plt.imshow(cloud)
plt.axis('off')
plt.show
<function matplotlib.pyplot.show(*args, **kw)>