Uploaded by Dhinu Anoop

Big Data in Smart Cities Literature Review

advertisement
1
Big Data in Smart Cities Literature Review
Anoop Parappully
Data Science & Big Data Analy (ITS-836-M30) - Full Term
University of the Cumberlands
Professor Dr. Donald Walker
September 18, 2022
2
1. INTRODUCTION
Inhabitants' evolution, urbanization, and weather difference are some of the recent most
influential challenges. Lately, the vision of a smart city, which encloses innovative well-being,
savvy transit, and a competitive society, has drawn much awareness due to its effect on individuals'
differentia of possession. Deploying big data to metropolia can resolve the community's numerous
concerns. Hence, the evolution of big data and the Internet of Things (IoT) has recreated an integral
part in the feasibility of innovative city endeavors. Big data presents the prospect for metropolia
to acquire beneficial understandings from a considerable quantity of data accumulated on different
bases. The IoT permits merging sensors, radiofrequency recognition, and Bluetooth in the realworld domain by employing favorably networked services. The assortment of the IoT and big data
is a new research zone that has obtained reinvigorated and exhilarating challenges for acquiring
the objective of prospective smart cities.
This research paper illustrates a methodical review of big data handling techniques in
smart cities. I investigate study measures issued between 2013 and 2021, where these methods
are classified on their algorithms and architectures. The central notions, evaluation strategies,
mechanisms, metrics, algorithm classifications, edges, and weaknesses are analyzed. Besides, critical evaluation elements are presented in which scalability and availability by 16%, duration by
15%, and precision by 11% are more in direction. Ultimately, some of the challenges, available
problems, and future trends worth additional examination are proposed in big data handling procedures in smart cities.
In this research paper, I also depict the current transmission technologies and intelligent
applications operated within the background of smart cities. The concepts of big data analytics to
sustain smart cities are examined by concentrating on how big data can transform urban inhabitants
3
of various statuses. Moreover, a prospective enterprise standard that can handle big data for smart
cities is suggested, and the enterprise and technical analysis challenges are recognized. This reflection can act as a model for students and enterprises for the future advancement and evolution
of smart cities in the context of big data.
2. BACKGROUND
The evolution of big data analysis can be advantageous in smart city transformations. Big data
typically offers enormous and complex details that depict individuals’ careers and is definable in
terms of dimensions, research, processes, or consequences on businesses. Cities in the universe
collect much data applicable to city life from items and stakeholders. The background section
summarizes the introductory paradigms in smart cities and big data analytics and their relationships. The following are the subsections as part of the background.
 Describe the issue: Deals with smart cities
 Discuss the problem: Big data in smart cities
 Techniques
 Algorithms Architectures
(Ahmed M. Shahat Osman & Ahmed Elragal)
4
2.1 DESCRIBE THE ISSUE: DEALS WITH SMART CITIES
Big data analytics is stabilized to alter the way of individuals in their live and work. Enterprises
currently bear to overview the purpose and design unknown patterns to fit big data science. However, the notion of big data analytics has evolved so prevalently in e-commerce that there are rare
kinds of studies in scientific congregations. Big data analysis has evolved into a compelling approach in smart cities. Smart city innovations are generally available information attainable in
several circumstances, like solitude, power, healthcare, smart homes, and transportation.
The concept of a smart city itself incorporates a combination of upright territories. Experimenters
offer different application zones around smart cities acquired from a significant number of recent
examinations, including smart devices, atmosphere, smart home, power consumption, transportation, stores, agriculture, protection, well-being, hospitality, and smart practicum.
The communication evolution of bearable, vitality in the metropolis, and data and transmission technology has recently joined beneath the designation of the smart tolerable city.
Consequently, the smart city has been designated an emergent technology for addressing metropolitan affairs. A smart city is a subset of a more comprehensive discussion of big data analysis.
Big data are an enormous quantity of terabytes or more data forged through and concerning
individuals, entities, and activities and responses among them in smart cities. The growth of smart
cities is an excellent model of the infectious and employment of big data analysis. The term
smart city guides to a wide range of paradigms; nevertheless, it frequently directs to digital
techniques to develop data to improve the municipality’s yield, promote citizens’ fitness and heal
residents’ viability. Typically, the practical completion of assistance is not restricted by techno-
5
logical problems but by the absence of a widely acknowledged transmission and service architecture that can be summarized from the distinctive characteristics of exceptional technology and
deliver a coordinated admission to the services. These assistances are as follows:
 Power management: Metropolitan big data analytics can supply users with a power consumption tracking system for the entire metropolis that offers control and residents a transparent picture of the mandated energy for different services. Therefore, recognizing introductory power consumption origins will be attainable, and it will be effortless to establish
preferences to optimize their conduct. It runs in sequence with the European order for fuel
efficiency progress in years to arrive.
 Air quality: Renewable Energy Directive was embraced by the European Union with the
objective of environment modification lessening for the previous decade. A 20% reduction
in power consumption by enhancing fuel efficiency by 2020, a 20% drop in conservatory
gas emissions by 2020 approximated with 1990 groups, and a 20% incline in engulfing
renewable power by 2020 are the plans established.
 Noise monitoring: Noise can be regarded as a state of aural corrosion of air. City leaders
have developed distinct regulations to decrease the quantity of noise in city centers at typical hours of noise exhibit. In the locations assuming the assistance, metropolitan big data
analytics may offer a noise tracking service to estimate the quantity of noise made. By
fixing a space-time map of the din corrosion in the community, we can implement general
protection via voice detection algorithms that can determine the breaking noise of glass,
bowls, etc. In this city, the ease at nighttime and the trust of public building proprietors can
be enhanced; regardless, the induction of sound sensors or environmental microphones is
somewhat contentious because of individuals’ privacy.
6
 Waste management: In numerous contemporary municipalities, trash control is essential
because of service expenditures and trash yard depository crises. The infiltration of ICT
solutions in this field might guide to significant financial, holding, and ecological benefits.
 Traffic management: Except for air quality and noise tracks, big data analytics can scrutinize the municipality's traffic bottleneck. Despite the obtainability of camera-based traffic
tracking systems in numerous cities, a more rampant data reference can be delivered by a
low-power across-the-board contact. The GPS established on current automobiles can observe traffic and embrace a variety of air rates and acoustic detectors on a path.
 Smart parking: Founded on street detectors and intelligent shows, smart parking assistance requires automobiles to the adequate parking zones in the metropolis. The advantages
of this assistance are numerous: The quicker it discovers a parking space, the smaller the
carbon emission, the more nominal traffic congestion, and the more satisfied residents will
be. This service can be incorporated into metropolitan big data analytics.
2.2 DISCUSS THE PROBLEM: BIG DATA IN SMART CITIES
There appears to be a standard version of big data analysis when individuals operate it. However, illuminations change across fields, ranges, and scientists. There are several famous specifications on five traditional V-attributes of big data, which are offered. Big data are a tremendous
quantity of information developed through and close people, entities, motions, and replies among
them. They are also immense to conform to Excel; they originated in real-time. Developed cities
have major constellations of computers and systems, a continuous instability of data transmutation,
data interaction, and science sharing, and extreme conduct in the smart community development
of a city in its significant day-to-day proceedings.
7
Digital systems have extensively modified current towns; these methods are evolving intelligent information implements and data commerce lattice. Networked municipalities include residents, transportation means, shipments, information, and witting. Data delivered via various aids
like the Internet, phone commerce, retail commerce, public media, and detectors do different industrial and administration exercises. Big data analytics is the primary source for the development
of developing smart cities. It is even the strategy a government deploys in its estimating infrastructure to explore and accredit overweight scopes and velocity of data to produce worthiness.
There are combinations of devices and notions linked to the big data, some of which are concerned. One of the most significant visions in the big data content is MapReduce. MapReduce is
an uncomplicated programming prototype for developing disseminated data in diverse models in
cloud-based domains. There are three primary factors in MapReduce comprehended as Master,
Map process, and reduce. MapReduce has a workflow where each job creates two user-specified
actions: The first is named Map, and the two Reduce. It can substantially support the execution
of programs in similitude systems.
Hadoop is an in-demand execution of MapReduce that profits from its open-source features.
Undoubtedly, Hadoop spreads information between blocks and separates these split data into
various nodes of a set. This mechanism allows the programs to be conducted on large clusters of
appropriate hardware. In accumulation, Spark is a high-speed, interactive, and clustering estimating device, causing it a proper technology for stream processing. Beneath, we present
Storm, Hive, Flume, and YARN means. Storm manages a tremendous deal of data via realtime loss forbearance research. Mainly, Storm is employed for massive data. Hive is operated
for keeping data conducted in Hadoop; it is also accountable for data governance. Hive provides a
data request interface SQL-based. Flume is appropriate for collecting huge information like
8
incidents and log files. By using Flume, it is likely to move up details on numerous servers in real
time.
2.3 TECHNIQUES
2.4 ALGORITHMS
The algorithm-based techniques for big data analysis in smart cities are examined, and the
taxonomy of pertinent publications is presented and then summarized in the below Section. Algorithm-based systems are contained and offered for big data analysis in smart cities like primary
benefits, weaknesses, central concepts, algorithm sorts, evaluation methods, devices, and case
analyses detail the metrics employed in the literature, including price, availability, sustainability, duration, precision, scalability, healthiness, safety, throughput, feasibility, and protection that is based on the algorithm. This procedure consists of three domains: Heuristic algorithms, meta-heuristic, and no heuristic algorithms.
In Phase 1, the investigation procedure is conducted by involving an electronic search on
Google Scholar as the predominant search engine founded on recognized educational pulisers
like IEEE2, Sage3, ACM4, Wiley5, Emerald6, Springer7, ScienceDirect8,and Talor & Francis9. The following examination system was selected by counting different spellings of actual ingredients to find corresponding analyses. We discovered 725 reports out of other classes connected
to the research subject.
(“Big data” OR Hadoop OR Spark OR Storm) AND (“smart city” OR “smart cities” OR
urban)
In Phase 2 and conforming to the standards of Table 2, we discovered these articles that stood
disseminated from 2013 to February 2021. Due to the broad spectrum of publications on big data
9
and smart cities and despite good discussion papers, just serial source information-indexed documents had been assumed. For contingency reasons, non-English articles, textbook branches, and
examination reports were removed. Thus, 119 articles were selected. Figure 2 shows an outline of
the executed process in this analysis.
In Phase 3, inappropriate articles were withdrawn founded on complete textbook. As a consequence, 46 articles were chosen.
Inclusion and exclusion criteria
Inclusion criteria
1 Analysis articles that current affairs, answers, or evaluation of big data and smart cities
2 Examinations are issued between 2013 to February
2021
3 JCR-indexed articles linked to smart cities and big
data
4
Exclusion criteria
Articles that are text branches, critiques, and contemplations
Documents that cannot be discovered
in a rich textbook or smaller than six
runners
Essays not documented in English
The investigation that are not connected to the examination inquiries
Fig. 2. Filtering procedures of paper selection
Summary of algorithm-based approaches
10
In these analyses, a plan, a system, or an algorithm is shown to crack issues. A synopsis of
algorithm-based systems is given in this section. In this area, the outline of algorithm-based approaches is included and offered for big data analysis in smart cities like primary benefits, weaknesses, main concepts, algorithm classes, evaluation strategies, devices, and subject examinations.
The metrics employed in the publications, including expenditure, availability, duration, scalability,
soundness, safety, accuracy, throughput, feasibility, sustainability, and security, are established on
the algorithm.
(Comparing the metrics used in the literature of algorithm-based approaches)
11
2.5 ARCHITECTURES
In this first, the architecture-based techniques for big data research in smart cities are examined, and anatomy of pertinent publications is presented and then summarized. The articles in this
area suggest an architecture to unravel problems and challenges. Then a synopsis of architecture
established systems is given. Summary of the preferred architecture-based systems Tang introduced a hierarchically split fog computing architecture to support the accumulation of a significant
digit of foundational components and assistance in smart cities. To keep the futurity of all residents, it is essential to totalize the mind in the fog calculating architecture (e.g., to take outdata
performance and feature derivation, to identify unique and difficult issues, and to offer optimum
solutions). The writers instructed the psyche at the edge of a layered haze computing yield. The
calculation tangles at per substrate performed prototype 4-class fog-based analysis models to show
the efficiency and potential of the standard in the entire municipality. They afterward introduced
the “smartness” of the village substructure. This article did not utilize advanced ML algorithms in
the whole design.
Khan et al. presented an architecture to process big data by involving the Hadoop framework
beside Spark and GraphX. The writer showed a three-degree architecture to manage data virtually
from detectors bound to extra supplements in the streets of smart cities in the data group class.
Managing huge data and delivering it to the Hadoop framework is achieved by utilizing a unique
architecture from a hierarchical actuator concept. The timing of the Hadoop design is such that it
negates the gear in the design. The determination module is further established on a separate entry
on data that serve other conclusions on the data according to the different sill. Finally, the proposed
technique is discussed in more data from trustworthy help to set its efficiency on difficulty. As
12
groups and the processing period expansion, the strategy's persuasion also improves. The drawback of this article is that, with identical parts on extra lanes, comparable effects were not accepted.
3. RESEARCH QUESTIONS
In this analysis, we used two techniques. First, we systematically reviewed BDA frameworks
within the context of SCs. The list of nominee compositions is then diagnosed according to the
managed SC fields. This effort seeks to explore the recently posted articles presenting BDA frameworks in the context of SCs. This survey allows exploring the investigation approaches in these
two areas together. During the identical period, it helps determine the typical traits of domainindependent BDA frameworks and compares them in the SC context. This stage ends with selecting a domain-independent that sustains decision-making with data exchangeability. In the two
steps, a prototype for the regular BDA framework is instantiated utilizing industry-known software
containers for practical evaluation. To keep the ratio between technological and organizational
dominance in designing and evaluating the framework prototype, we abide by the action design
research (ADR) approach proposed. ADR consists of four stages, each anchored to seven principles.
By examining broadcast analyses, students can comprehend the challenges and concerns of the
big data operating systems in smart cities and present unknown thoughts. This article tries to answer six investigation queries (RQs) founded on the dreams and environment of the proposed investigation. The queries are as observed:
 RQ1: Which effective data handling practices are involved in smart cities?
 RQ2: What evaluation elements are employed in big data operating systems in smart cities?
 RQ3: What evaluation methods are employed to assess the comprehensive data-taking systems in smart cities?
13
 RQ4: What famous evaluation conditions and modeling agencies are utilized in big datataking systems in smart cities?
 RQ5: Which algorithms are employed in carrying big data practices in smart cities?
 RQ6: What challenges, available problems, and future trends are placed in big data-taking
systems in smart cities to view the following direction?
The main advantages and disadvantages of algorithm-based and architecture-based
approaches
Advantages
Disadvantages
Algorithm-based
 Better accuracy
 Better scalability
 Better availability
 Better cost
 Unsatisfactory complexity
 Unsatisfactory security
 Unsatisfactory safety
Architecture-based
 Better availability
 Better scalability
 Better execution time




Unsatisfactory security
Unsatisfactory safety
Unsatisfactory feasibility
Unsatisfactory accuracy
4. METHODOLOGY
This unit offers an SLR process to do a large-ranging and moral analysis of big data operating
procedures in smart cities. The SLR provides a basis or framework for good positioning for new
analysis strategies. These assessment practices require more industry than expected reviews, and
the SLR assessment techniques can require quantity-based model research. We investigated this
from respected periodicals such as Springer, IEEE, ScienceDirect, etc. This area contains query
building. The essential step of accomplishing an SLR is thoroughly studying the selected papers.
RQ2: What evaluation factors are applied to big data handling approaches in smart cities?
14
According to the Tables and to answer RQ2, the evaluation elements are determined. The statistic percent indicates the term action with 16%, the scalability with 15%, and precision with 12%.
Describes the evaluation elements in the books as a model. As defined in the said formation, in the
algorithm-based documents, the precision and scalability characteristics, and in the architecturebased documents, period, availability, and scalability are numerous regular.
(Percentage of evaluation factors of big data in the smart city in the literature)
15
(Repetition of evaluation factors in each category)
5. DATA ANALYSIS
The development of big data analysis can be helpful in smart city transformations. Big data
typically offers enormous and detailed information that shows individuals' careers and is definable
in scope, research techniques, or consequences on corporations. Cities in the universe collect much
data applicable to city life from items and stakeholders. Thus, the Seoul form identified midnight
metropolis representatives and ordered city buses to use, facilitating that midnight public assistance. In accumulation, the San Francisco circumstances discovered that investigating illegal dossiers improved public safety assistance. The Rio de Janeiro form utilized data from illustrations,
videotapes, and scribes to decrypt city issues like temperature, energy conservation, and protection. So did Santander of Spaniard and Cosenza city of Italy. Different investigators have examined
the application of big data analysis in smart cities and set related problems in extraordinary circumstances like conveyance, public safety, and sustainability. Therefore, big data analysis can be
instrumental and positively functional in smart cities. Not fighting the consequence of a smart city
16
in big data tolerating systems, there are rare periodic considerations regarding handling practices
of big data in smart cities. Hence, this article examines the current investigations in big data analytics for smart cities simply and systematically.
The immediate assistance of this article is as follows:
 The immediate assistance of this article is as follows:
 Presenting a systematic literature review (SLR) for big data handling systems in smart
cities and critical triumphs in this field.
 Proposing a variety of articles established on algorithms and architecture looms for
big data analysis in smart cities
 This analysis examines mechanisms, central concepts, evaluation methods, algorithm
classes, subject breakdowns, and benefits and drawbacks.
 Suggesting a comparison of significant issues of big data driving systems in smart cities.
 Exploring an abstract of contemporary challenges, prospective analysis, available
problems, and the efficient function significant data tolerating techniques can have in
smart cities.
17
(Approach types applied to big data analysis in smart cities)
6. CONCLUSION
To conclude, this research was founded on the SLR process for enduring systems to big
data in smart cities. In the research procedure, we chose 46 articles from 725 writings from our
investigation question between the years 2013 to February 2021. According to this SLR, the most
journal of manuscripts was in 2018. Most of them were connected to ScienceDirect and IEEE
publishers and the most insignificant to Wiley and Taylor & Francis. The assigned articles were
categorized into two major parties, including algorithm-based and architecture-based approaches.
The algorithm-based process finds the numerous printed pieces 57% and architecture-based by
43% forms. Also, the issue of algorithm-based was metaheuristic, heuristic, and non-heuristic. The
meta-heuristic had the most elevated digit of records, with 38%. In this course, 51% of the investigation reports utilized actual testbeds to appraise the proposed researches. In addition, 24% of
18
researchers used simulation. In the literature reviewed, Spark, Hadoop, and MATLAB had the
highest and most across-the-board use of big data-related devices.
The statistic percent of the evaluation elements revealed that the moment, scalability, availability, and precision had been employed most in this investigation. Similarly, we explained the
challenges, available problems, and prospective results in particular in the domain. The present
research is not complimentary of regulations. Firstly, non-English articles, textbook branches, federal periodicals, discussion forms, and documents that contained trimmer than six pages were
banned from our analysis. Secondly, this article was created on six queries, but there could be
additional inquiries in this respect. Thirdly, our taxonomy was algorithm-based and architecturebased, while other experimenters could offer different kinds of taxonomy. Fourthly, due to the
broad spectrum of big data and smart cities publications, only JCR-indexed journal papers had
been considered despite other reputed conference papers. Further, according to the databases we
utilized, some related reports might not have been evaluated via article discovery procedures.
19
REFERENCES:
(PDF) incorporating intelligence in fog computing for ... - researchgate. (n.d.). Retrieved September 18, 2022, from https://www.researchgate.net/publication/313758226_Incorporating_Intelligence_in_Fog_Computing_for_Big_Data_Analysis_in_Smart_Cities
Bibri, S. E., & Krogstie, J. (1970, January 1). [PDF] Smart Sustainable Cities of the Future: An
extensive interdisciplinary literature review: Semantic scholar. undefined. Retrieved September 18, 2022, from https://www.semanticscholar.org/paper/Smart-sustainable-cities-ofthe-future%3A-An-review-Bibri-Krogstie/1059753ba698c6fe5e30952bcb2714e0c205b742
Bui, N. (2022, September 5). Internet of things for smart cities. IEEE Internet of Things Journal.
Retrieved September 18, 2022, from https://www.academia.edu/17042316/Internet_of_Things_for_Smart_Cities
Cicirelli, F., Guerrieri, A., Spezzano, G., & Vinci, A. (1970, January 1). An edge-based platform
for Dynamic Smart City Applications: Semantic scholar. undefined. Retrieved September
18, 2022, from https://www.semanticscholar.org/paper/An-edge-based-platform-for-dynamic-Smart-City-Cicirelli-Guerrieri/223c06748fa33b2c28a714be430f85870ac79c11
Current trends in Smart City Initiatives: Some stylised facts. (n.d.). Retrieved September 18, 2022,
from
https://www.researchgate.net/profile/Paolo-Neirotti/publication/260015335_Current_trends_in_Smart_City_initiatives_Some_stylised_facts/links/5ba46d3792851ca9ed1a1c18/Current-trends-in-Smart-City-initiativesSome-stylised-facts.pdf
Haque, H., Zulfiqar, H., Ahmed, A., & Ali, Y. (1970, January 1). A context‐aware framework for
modelling and verification of Smart Parking Systems in urban cities: Semantic scholar. undefined. Retrieved September 18, 2022, from https://www.semanticscholar.org/paper/Acontext%E2%80%90aware-framework-for-modelling-and-of-in-HaqueZulfiqar/e52dd64ad77b8541b3b74bc61f6ddb0c2ba026c1
Ju, J., Liu, L., & Feng, Y. (1970, January 1). [PDF] citizen-centered big data analysis-driven
governance intelligence framework for smart cities: Semantic scholar. undefined. Retrieved
September 18, 2022, from https://www.semanticscholar.org/paper/Citizen-centered-bigdata-analysis-driven-framework-Ju-Liu/57ca899947d9c9cc295cee4f93c4c104ffa174a1
Kashani, M. H., Madanipour, M., Nikravan, M., Asghari, P., & Mahdipour, E. (1970, January 1).
A systematic review of IOT in Healthcare: Applications, techniques, and trends: Semantic
scholar. undefined. Retrieved September 18, 2022, from https://www.semanticscholar.org/paper/A-systematic-review-of-IoT-in-healthcare%3A-and-KashaniMadanipour/6c45a4d57be64d6c986751fc4969569f7dd9d6df
20
Kitchin, R. (2013, November 29). The real-time city? big data and smart urbanism - geojournal.
SpringerLink. Retrieved September 18, 2022, from https://link.springer.com/article/10.1007/s10708-013-9516-8
Lazarova-Molnar, S., & Mohamed, N. (2017, November 15). Collaborative data analytics for
smart buildings: Opportunities and models - cluster computing. SpringerLink. Retrieved
September 18, 2022, from https://link.springer.com/article/10.1007/s10586-017-1362-x
Manyika, J., Chui, M., Brown, B., Bughin, J., Dobbs, R., Roxburgh, C., & Byers, A. H. (2020,
February 13). Big Data: The Next Frontier for Innovation, competition, and Productivity.
McKinsey & Company. Retrieved September 18, 2022, from https://www.mckinsey.com/business-functions/mckinsey-digital/our-insights/big-data-the-next-frontier-for-innovation
Rahimi, M., Songhorabadi, M., & Kashani, M. H. (1970, January 1). Fog-based Smart Homes: A
systematic review: Semantic scholar. undefined. Retrieved September 18, 2022, from
https://www.semanticscholar.org/paper/Fog-based-smart-homes%3A-A-systematic-review-Rahimi-Songhorabadi/ddd5a24d3f656db178fe7433840cc4805209c8f7
Sci-Hub | Business Model Analysis of public services operating in the ... (n.d.). Retrieved September 18, 2022, from https://sci-hub.se/10.1016/j.future.2017.01.032
Song, H., Srinivasan, R., Sookoor, T., & Jeschke, S. (2017). Smart cities: Foundations, principles,
and applications. Amazon. Retrieved September 18, 2022, from https://www.amazon.com/Smart-Cities-Foundations-Principles-Applications/dp/1119226392
Wang, X., White, L., & Chen, X. (2015, October 19). Big Data Research for the knowledge economy: Past, present, and future. Industrial Management & Data Systems. Retrieved September 18, 2022, from https://www.emerald.com/insight/content/doi/10.1108/IMDS-09-20150388/full/html
Related documents
Download