{"id":87,"date":"2017-11-03T19:57:59","date_gmt":"2017-11-03T18:57:59","guid":{"rendered":"http:\/\/wchmurze.cloud\/?p=87"},"modified":"2019-08-18T15:10:07","modified_gmt":"2019-08-18T13:10:07","slug":"apache-spark-pierwsze-poczatki","status":"publish","type":"post","link":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/","title":{"rendered":"Apache Spark &#8211; pierwsze pocz\u0105tki"},"content":{"rendered":"<h2>Poradnik:<\/h2>\n<h3>Jak uruchomi\u0107 na komputerze z MS Windows lokaln\u0105 instancj\u0119 Apache Sparka pracuj\u0105c\u0105 na jednym nodzie z gotow\u0105 obs\u0142ug\u0105 skrypt\u00f3w w Pythonie (pyspark) i przyk\u0142adowymi zbiorami danych.<\/h3>\n<p>Na podstawie materia\u0142\u00f3w ze strony<br \/>\n<a href=\"http:\/\/semantica.cs.lth.se\/pyspark\/#\/pyspark-vm\" target=\"_blank\" rel=\"noopener noreferrer\">http:\/\/semantica.cs.lth.se\/pyspark\/#\/pyspark-vm<\/a><\/p>\n<p>Dla tych, kt\u00f3rzy na co dzie\u0144 korzystaj\u0105 ze stacji roboczej MS Windows.<\/p>\n<p>Potrzebne oprogramowanie<br \/>\na) Virtual Box ( wirtualizator)<br \/>\n<a href=\"https:\/\/www.virtualbox.org\/wiki\/Downloads\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/www.virtualbox.org\/wiki\/Downloads<\/a><br \/>\nInstalujemy najwy\u017csz\u0105 wersj\u0119 , jak\u0105 aktualnie obs\u0142uguje Vagrant<br \/>\nGUI VirtualBoxa nie b\u0119dzie wog\u00f3le wykorzystywane.<\/p>\n<p>b) Vagrant (nak\u0142adka na wirtualizator)<\/p>\n<p><a href=\"https:\/\/www.vagrantup.com\/downloads.html\">https:\/\/www.vagrantup.com\/downloads.html<\/a><\/p>\n<p>Instalujemy obraz<\/p>\n<pre lang=\"text\">vagrant box add http:\/\/semantica.cs.lth.se\/pyspark\/vm.json\r\n<\/pre>\n<p>To chwil\u0119 trwa, cierpliwo\u015bci, obraz\u00a0 ponad 1 GB obj\u0119to\u015bci.<\/p>\n<p>Sprawdzamy zainstalowane obrazy maszyn wirtualnych.<\/p>\n<pre lang=\"text\">vagrant box list\r\nlth-pysparkvm (virtualbox, 1.0.1)\r\n<\/pre>\n<p>Sprawdzamy status maszyn<\/p>\n<pre lang=\"text\">vagrant status\r\nCurrent machine states:\r\nlth-pysparkvm             poweroff (virtualbox)\r\nThe VM is powered off. To restart the VM, simply run `vagrant up`\r\n<\/pre>\n<p>Uruchamiamy maszyn\u0119 wirtualn\u0105<\/p>\n<pre lang=\"text\">vagrant up lth-pysparkvm\r\nBringing machine 'lth-pysparkvm' up with 'virtualbox' provider...\r\n==&gt; lth-pysparkvm: Checking if box 'lth-pysparkvm' is up to date...\r\n==&gt; lth-pysparkvm: Clearing any previously set forwarded ports...\r\n==&gt; lth-pysparkvm: Clearing any previously set network interfaces...\r\n==&gt; lth-pysparkvm: Preparing network interfaces based on configuration...\r\n    lth-pysparkvm: Adapter 1: nat\r\n==&gt; lth-pysparkvm: Forwarding ports...\r\n    lth-pysparkvm: 8888 (guest) =&gt; 8081 (host) (adapter 1)\r\n    lth-pysparkvm: 8002 (guest) =&gt; 8082 (host) (adapter 1)\r\n    lth-pysparkvm: 4040 (guest) =&gt; 4040 (host) (adapter 1)\r\n    lth-pysparkvm: 4041 (guest) =&gt; 4041 (host) (adapter 1)\r\n    lth-pysparkvm: 4042 (guest) =&gt; 4042 (host) (adapter 1)\r\n    lth-pysparkvm: 4043 (guest) =&gt; 4043 (host) (adapter 1)\r\n    lth-pysparkvm: 4044 (guest) =&gt; 4044 (host) (adapter 1)\r\n    lth-pysparkvm: 4045 (guest) =&gt; 4045 (host) (adapter 1)\r\n    lth-pysparkvm: 22 (guest) =&gt; 2222 (host) (adapter 1)\r\n==&gt; lth-pysparkvm: Running 'pre-boot' VM customizations...\r\n==&gt; lth-pysparkvm: Booting VM...\r\n==&gt; lth-pysparkvm: Waiting for machine to boot. This may take a few minutes...\r\n    lth-pysparkvm: SSH address: 127.0.0.1:2222\r\n    lth-pysparkvm: SSH username: vagrant\r\n    lth-pysparkvm: SSH auth method: private key\r\n==&gt; lth-pysparkvm: Machine booted and ready!\r\n==&gt; lth-pysparkvm: Checking for guest additions in VM...\r\n    lth-pysparkvm: The guest additions on this VM do not match the installed version of\r\n    lth-pysparkvm: VirtualBox! In most cases this is fine, but in rare cases it  can\r\n    lth-pysparkvm: prevent things such as shared folders from working properly.\r\nIf you see\r\n    lth-pysparkvm: shared folder errors, please make sure the guest additions within the\r\n    lth-pysparkvm: virtual machine match the version of VirtualBox you have installed on\r\n    lth-pysparkvm: your host and reload your VM.\r\n    lth-pysparkvm:\r\n    lth-pysparkvm: Guest Additions Version: 4.3.30\r\n    lth-pysparkvm: VirtualBox Version: 5.1\r\n==&gt; lth-pysparkvm: Mounting shared folders...\r\n    lth-pysparkvm: \/home\/vagrant\/notebooks\/host =&gt; D:\/developing\/Vagrant\/bin\r\n\r\n\r\n<\/pre>\n<p>Jak wygl\u0105da plik VagrantFile, warto na niego spojrze\u0107, w szczeg\u00f3lno\u015bci na numery port\u00f3w, kt\u00f3re s\u0105 forwardowane.<\/p>\n<pre lang=\"RUBY\"># -*- mode: ruby -*-\r\n# vi: set ft=ruby :\r\n\r\nVagrant.configure(2) do |config|\r\n  config.vm.define \"lth-pysparkvm\" do |master|\r\n    master.vm.box = \"lth-pysparkvm\"\r\n\r\n    master.vm.network :forwarded_port, host: 8081, guest: 8888, auto_correct: true   # Jupyter port\r\n    master.vm.network :forwarded_port, host: 8082, guest: 8002, auto_correct: true   # Updater port\r\n    master.vm.network :forwarded_port, host: 4040, guest: 4040, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.network :forwarded_port, host: 4041, guest: 4041, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.network :forwarded_port, host: 4042, guest: 4042, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.network :forwarded_port, host: 4043, guest: 4043, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.network :forwarded_port, host: 4044, guest: 4044, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.network :forwarded_port, host: 4045, guest: 4045, auto_correct: true   # Spark UI (Driver)\r\n    master.vm.usable_port_range = 4040..4090\r\n\r\n    master.vm.provider :virtualbox do |vb|\r\n      vb.name = \"lth-pysparkvm\"\r\n\r\n      # Assign 2 CPUs\r\n      vb.customize [\"modifyvm\", :id, \"--cpus\", \"2\"]\r\n\r\n      # Assign 2 GB Memory\r\n      vb.customize [\"modifyvm\", :id, \"--memory\", \"2048\"]\r\n\r\n      # Disable USB\r\n      vb.customize [\"modifyvm\", :id, \"--usb\", \"off\"]\r\n      vb.customize [\"modifyvm\", :id, \"--usbehci\", \"off\"]\r\n    end\r\n  end\r\n\r\n  config.vm.synced_folder \".\", \"\/home\/vagrant\/notebooks\/host\"\r\nend\r\n<\/pre>\n<p><strong>Uwagi, kt\u00f3re mog\u0105 zmniejszy\u0107 frustracj\u0119:<\/strong><\/p>\n<p>Je\u015bli zainstalujemy zbyt now\u0105 wersj\u0119 VirtualBoxa Vagrant jej nie wykryje, a co wi\u0119cej b\u0119dzie komunikowa\u0107 nas o braku HyperV.<\/p>\n<p>Co ciekawe na komputerach z windows 7 pro\u00a0 instalacja tego typu oprogramowania jest mo\u017cliwa, ale tej wersji nie testowa\u0142em,<\/p>\n<p>pozostajemy przy oprogramowaniu Virtualbox.<\/p>\n<p>Dla dociekliwych:<\/p>\n<p><a href=\"https:\/\/www.altkomakademia.pl\/baza-wiedzy\/qna\/discussion\/2763\/instalacja-hyper-v-manager-w-windows-7-\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/www.altkomakademia.pl\/baza-wiedzy\/qna\/discussion\/2763\/instalacja-hyper-v-manager-w-windows-7-<\/a><br \/>\nNa stacjach z Windows 7 standardowo zainstalowany jest powershell w versji 2.0. Wykonanie polecenia Vagrant up nie ko\u0144czy si\u0119. Nale\u017cy zaktualizowa\u0107 framework np. do wersji 4.0.<\/p>\n<p>Wersja PowerShella przed:<\/p>\n<pre lang=\"powershell\">PS C:\\Users\\User&gt; $PSVersionTable\r\n\r\nName                           Value                                                     \r\n----                           -----                                                     \r\nCLRVersion                     2.0.50727.8762                                            \r\nBuildVersion                   6.1.7601.17514                                            \r\nPSVersion                      2.0                                                       \r\nWSManStackVersion              2.0                                                       \r\nPSCompatibleVersions           {1.0, 2.0}                                                \r\nSerializationVersion           1.1.0.1                                                   \r\nPSRemotingProtocolVersion      2.1      \r\n<\/pre>\n<p>Po instalacji ze strony Microsoftu<\/p>\n<p>https:\/\/www.microsoft.com\/en-us\/download\/details.aspx?id=40855<\/p>\n<p>Wersja powershella po:<\/p>\n<pre lang=\"powershell\">PS C:\\Users\\User&gt; $PSVersionTable\r\nName                           Value                                                                                                  \r\n----                           -----                                                                                                  \r\nPSVersion                      4.0                                                                                                    \r\nWSManStackVersion              3.0                                                                                                    \r\nSerializationVersion           1.1.0.1                                                                                                \r\nCLRVersion                     4.0.30319.42000                                                                                        \r\nBuildVersion                   6.3.9600.18728                                                                                         \r\nPSCompatibleVersions           {1.0, 2.0, 3.0, 4.0}                                                                                   \r\nPSRemotingProtocolVersion      2.2                                                                                                    \r\n\r\nPS C:\\Users\\User&gt; \r\n\r\n<\/pre>\n<p>W wersji VirtualBox (5.1) domy\u015blnie nie ma dost\u0119pu do sieci zewn\u0119trznej od \u015brodka maszyny, przez co nie mo\u017cna zaktulizowa\u0107 jej oprogramowania. Pomaga poprawa w pliku konfiguracyjnym Vagrantfile<\/p>\n<p>Nale\u017cy doda\u0107 w pliku Vagranfile dodatkowe linie kodu<\/p>\n<pre lang=\"ruby\"># run net outside VM\r\n      vb.customize ['modifyvm', :id, '--natdnsproxy1', 'on']\r\n      vb.customize ['modifyvm', :id, '--natdnshostresolver1', 'on']\r\n      vb.customize ['modifyvm', :id, '--cableconnected1', 'on']\r\n      \r\n<\/pre>\n<h2>Pierwsze kroki<\/h2>\n<h4>1. Uruchomienie skryptu Pythona w notatniku Jupiter<\/h4>\n<p>Uruchamiany stron\u0119 <a href=\"http:\/\/localhost:8081\" target=\"_blank\" rel=\"noopener noreferrer\">http:\/\/localhost:8081<\/a><\/p>\n<p>Jako silnik wybieramy pyspark<\/p>\n<p>W pierwszej kom\u00f3rce wpisujemy<\/p>\n<pre lang=\"PYTHON\">#show SparkContext object\r\nsc\r\n<\/pre>\n<p>Wyj\u015bcie<br \/>\nOut[1]:<\/p>\n<p>W drugiej kom\u00f3rce<\/p>\n<pre lang=\"PYTHON\">#show type of SparkContext\r\ntype(sc)\r\n<\/pre>\n<p>Wyj\u015bcie:<br \/>\nOut[2]: pyspark.context.SparkContext<\/p>\n<p>Przyk\u0142adowy wygl\u0105d notatnika<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-106\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark.jpg\" alt=\"Pyspark\" width=\"1501\" height=\"823\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark.jpg 1501w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark-300x164.jpg 300w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark-768x421.jpg 768w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark-1024x561.jpg 1024w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8081_pyspark-1200x658.jpg 1200w\" sizes=\"(max-width: 1501px) 100vw, 1501px\" \/><\/p>\n<h4>2. Aktualizujemy dost\u0119pne pliki z danymi<\/h4>\n<p>Uruchamiamy stron\u0119 <a href=\"http:\/\/localhost:8082\" target=\"_blank\" rel=\"noopener noreferrer\">http:\/\/localhost:8082<\/a><br \/>\nNaciskamy przycisk &#8220;Update all&#8221; i czekamy cierpliwie na aktualizacj\u0119, jeden z plik\u00f3w ma ponad 2GB danych.<br \/>\nPo aktualizacji wygl\u0105da to tak<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-97\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1.jpg\" alt=\"Datasets\" width=\"1633\" height=\"877\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1.jpg 1633w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1-300x161.jpg 300w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1-768x412.jpg 768w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1-1024x550.jpg 1024w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost8082_datasets-1-1200x644.jpg 1200w\" sizes=\"(max-width: 1633px) 100vw, 1633px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<h4>Dost\u0119p do UI Sparka jest\u00a0 pod adresem<\/h4>\n<p><a href=\"http:\/\/localhost:4040\" target=\"_blank\" rel=\"noopener noreferrer\">http:\/\/localhost:4040<\/a><\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-133\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/localhost_4040.jpg\" alt=\"localhost_4040\" width=\"1587\" height=\"239\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost_4040.jpg 1587w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost_4040-300x45.jpg 300w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost_4040-768x116.jpg 768w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost_4040-1024x154.jpg 1024w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/localhost_4040-1200x181.jpg 1200w\" sizes=\"(max-width: 1587px) 100vw, 1587px\" \/><\/p>\n<h4>Dost\u0119p przez SSH<\/h4>\n<p>&nbsp;<\/p>\n<p>Dost\u0119p do maszyny przez SSH jest na porcie 2222 interfejsu lokalnego.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-121\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/ssh_session_pyspark.jpg\" alt=\"ssh_session_pyspark\" width=\"606\" height=\"535\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_session_pyspark.jpg 606w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_session_pyspark-300x265.jpg 300w\" sizes=\"(max-width: 606px) 100vw, 606px\" \/><\/p>\n<p>&nbsp;<\/p>\n<p>Z pliku private_key nale\u017cy wygenerowa\u0107 klucz prywatny , kt\u00f3ry b\u0119dzie m\u00f3g\u0142 by\u0107 u\u017cyty przez PuttySSH.<\/p>\n<p>U mnie plik znajduje si\u0119 w podkatalogu, w kt\u00f3rym zainstalowano\u00a0 przez Vagranta\u00a0 wirtualn\u0105 maszyn\u0119:<\/p>\n<pre lang=\"text\">....Vagrant\\bin\\.vagrant\\machines\\lth-pysparkvm\\virtualbox\r\n<\/pre>\n<p>tak wygenerowany klucz prywatny nale\u017cy wskaza\u0107 na zakladce Connection\/SSH\/Auth.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-122\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/ssh_auth_pyspark.jpg\" alt=\"ssh_auth_pyspark\" width=\"606\" height=\"535\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_auth_pyspark.jpg 606w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_auth_pyspark-300x265.jpg 300w\" sizes=\"(max-width: 606px) 100vw, 606px\" \/><\/p>\n<p>Po uruchomieniu po\u0142\u0105czenia i akceptacji klucza logujemy\u00a0 si\u0119 na u\u017cytkownika vagrant\u00a0 z pustym has\u0142em.<\/p>\n<p><img decoding=\"async\" loading=\"lazy\" class=\"aligncenter size-full wp-image-124\" src=\"http:\/\/195.181.217.115\/wp-content\/uploads\/2017\/11\/ssh_auth_loginVagrant.jpg\" alt=\"ssh_auth_loginvagrant\" width=\"830\" height=\"515\" srcset=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_auth_loginVagrant.jpg 830w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_auth_loginVagrant-300x186.jpg 300w, https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/ssh_auth_loginVagrant-768x477.jpg 768w\" sizes=\"(max-width: 830px) 100vw, 830px\" \/><\/p>\n<p>&nbsp;<\/p>\n<h4>Po zako\u0144czeniu pracy wystarczy wy\u0142\u0105czy\u0107 maszyn\u0119 wirtualn\u0105<\/h4>\n<pre lang=\"text\">vagrant halt\r\n=== lth-pysparkvm: Attempting graceful shutdown of VM...\r\n<\/pre>\n","protected":false},"excerpt":{"rendered":"<p>Poradnik: Jak uruchomi\u0107 na komputerze z MS Windows lokaln\u0105 instancj\u0119 Apache Sparka pracuj\u0105c\u0105 na jednym nodzie z gotow\u0105 obs\u0142ug\u0105 skrypt\u00f3w w Pythonie (pyspark) i przyk\u0142adowymi zbiorami danych. Na podstawie materia\u0142\u00f3w ze strony http:\/\/semantica.cs.lth.se\/pyspark\/#\/pyspark-vm Dla tych, kt\u00f3rzy na co dzie\u0144 korzystaj\u0105 &hellip; <a href=\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\">Continued<\/a><\/p>\n","protected":false},"author":1,"featured_media":1075,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[11,9,6,10],"tags":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v19.13 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\" \/>\n<meta property=\"og:locale\" content=\"pl_PL\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko\" \/>\n<meta property=\"og:description\" content=\"Poradnik: Jak uruchomi\u0107 na komputerze z MS Windows lokaln\u0105 instancj\u0119 Apache Sparka pracuj\u0105c\u0105 na jednym nodzie z gotow\u0105 obs\u0142ug\u0105 skrypt\u00f3w w Pythonie (pyspark) i przyk\u0142adowymi zbiorami danych. Na podstawie materia\u0142\u00f3w ze strony http:\/\/semantica.cs.lth.se\/pyspark\/#\/pyspark-vm Dla tych, kt\u00f3rzy na co dzie\u0144 korzystaj\u0105 &hellip; Continued\" \/>\n<meta property=\"og:url\" content=\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\" \/>\n<meta property=\"og:site_name\" content=\"W chmurze o chmurze i nie tylko\" \/>\n<meta property=\"article:published_time\" content=\"2017-11-03T18:57:59+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2019-08-18T13:10:07+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/largest-open-source-apache-spark.png\" \/>\n\t<meta property=\"og:image:width\" content=\"684\" \/>\n\t<meta property=\"og:image:height\" content=\"509\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"djkormo\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Napisane przez\" \/>\n\t<meta name=\"twitter:data1\" content=\"djkormo\" \/>\n\t<meta name=\"twitter:label2\" content=\"Szacowany czas czytania\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minut\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\"},\"author\":{\"name\":\"djkormo\",\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323\"},\"headline\":\"Apache Spark &#8211; pierwsze pocz\u0105tki\",\"datePublished\":\"2017-11-03T18:57:59+00:00\",\"dateModified\":\"2019-08-18T13:10:07+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\"},\"wordCount\":471,\"commentCount\":2,\"publisher\":{\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323\"},\"articleSection\":[\"Apache\",\"Big Data\",\"Python\",\"Spark\"],\"inLanguage\":\"pl-PL\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\",\"url\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\",\"name\":\"Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko\",\"isPartOf\":{\"@id\":\"https:\/\/wchmurze.cloud\/#website\"},\"datePublished\":\"2017-11-03T18:57:59+00:00\",\"dateModified\":\"2019-08-18T13:10:07+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#breadcrumb\"},\"inLanguage\":\"pl-PL\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Strona g\u0142\u00f3wna\",\"item\":\"https:\/\/wchmurze.cloud\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Apache Spark &#8211; pierwsze pocz\u0105tki\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/wchmurze.cloud\/#website\",\"url\":\"https:\/\/wchmurze.cloud\/\",\"name\":\"W chmurze o chmurze i nie tylko\",\"description\":\"W chmurze o chmurze i nie tylko\",\"publisher\":{\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/wchmurze.cloud\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"pl-PL\"},{\"@type\":[\"Person\",\"Organization\"],\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323\",\"name\":\"djkormo\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"pl-PL\",\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/14a901b808871fa98086ae259c45d646?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/14a901b808871fa98086ae259c45d646?s=96&d=mm&r=g\",\"caption\":\"djkormo\"},\"logo\":{\"@id\":\"https:\/\/wchmurze.cloud\/#\/schema\/person\/image\/\"},\"url\":\"https:\/\/wchmurze.cloud\/index.php\/author\/djkormo\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/","og_locale":"pl_PL","og_type":"article","og_title":"Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko","og_description":"Poradnik: Jak uruchomi\u0107 na komputerze z MS Windows lokaln\u0105 instancj\u0119 Apache Sparka pracuj\u0105c\u0105 na jednym nodzie z gotow\u0105 obs\u0142ug\u0105 skrypt\u00f3w w Pythonie (pyspark) i przyk\u0142adowymi zbiorami danych. Na podstawie materia\u0142\u00f3w ze strony http:\/\/semantica.cs.lth.se\/pyspark\/#\/pyspark-vm Dla tych, kt\u00f3rzy na co dzie\u0144 korzystaj\u0105 &hellip; Continued","og_url":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/","og_site_name":"W chmurze o chmurze i nie tylko","article_published_time":"2017-11-03T18:57:59+00:00","article_modified_time":"2019-08-18T13:10:07+00:00","og_image":[{"width":684,"height":509,"url":"https:\/\/wchmurze.cloud\/wp-content\/uploads\/2017\/11\/largest-open-source-apache-spark.png","type":"image\/png"}],"author":"djkormo","twitter_card":"summary_large_image","twitter_misc":{"Napisane przez":"djkormo","Szacowany czas czytania":"5 minut"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#article","isPartOf":{"@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/"},"author":{"name":"djkormo","@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323"},"headline":"Apache Spark &#8211; pierwsze pocz\u0105tki","datePublished":"2017-11-03T18:57:59+00:00","dateModified":"2019-08-18T13:10:07+00:00","mainEntityOfPage":{"@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/"},"wordCount":471,"commentCount":2,"publisher":{"@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323"},"articleSection":["Apache","Big Data","Python","Spark"],"inLanguage":"pl-PL","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/","url":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/","name":"Apache Spark - pierwsze pocz\u0105tki - W chmurze o chmurze i nie tylko","isPartOf":{"@id":"https:\/\/wchmurze.cloud\/#website"},"datePublished":"2017-11-03T18:57:59+00:00","dateModified":"2019-08-18T13:10:07+00:00","breadcrumb":{"@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#breadcrumb"},"inLanguage":"pl-PL","potentialAction":[{"@type":"ReadAction","target":["https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/wchmurze.cloud\/index.php\/2017\/11\/03\/apache-spark-pierwsze-poczatki\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Strona g\u0142\u00f3wna","item":"https:\/\/wchmurze.cloud\/"},{"@type":"ListItem","position":2,"name":"Apache Spark &#8211; pierwsze pocz\u0105tki"}]},{"@type":"WebSite","@id":"https:\/\/wchmurze.cloud\/#website","url":"https:\/\/wchmurze.cloud\/","name":"W chmurze o chmurze i nie tylko","description":"W chmurze o chmurze i nie tylko","publisher":{"@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/wchmurze.cloud\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"pl-PL"},{"@type":["Person","Organization"],"@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/9832cc6f86f99f541d983d2b8d60f323","name":"djkormo","image":{"@type":"ImageObject","inLanguage":"pl-PL","@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/14a901b808871fa98086ae259c45d646?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/14a901b808871fa98086ae259c45d646?s=96&d=mm&r=g","caption":"djkormo"},"logo":{"@id":"https:\/\/wchmurze.cloud\/#\/schema\/person\/image\/"},"url":"https:\/\/wchmurze.cloud\/index.php\/author\/djkormo\/"}]}},"_links":{"self":[{"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/posts\/87"}],"collection":[{"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/comments?post=87"}],"version-history":[{"count":40,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/posts\/87\/revisions"}],"predecessor-version":[{"id":1076,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/posts\/87\/revisions\/1076"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/media\/1075"}],"wp:attachment":[{"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/media?parent=87"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/categories?post=87"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wchmurze.cloud\/index.php\/wp-json\/wp\/v2\/tags?post=87"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}