Package Details: apache-spark 3.5.0-1

Git Clone URL: https://aur.archlinux.org/apache-spark.git (read-only, click to copy)
Package Base: apache-spark
Description: A unified analytics engine for large-scale data processing
Upstream URL: http://spark.apache.org
Keywords: spark
Licenses: Apache
Submitter: huitseeker
Maintainer: ttc0419
Last Packager: ttc0419
Votes: 57
Popularity: 0.043039
First Submitted: 2015-10-04 09:31 (UTC)
Last Updated: 2023-09-29 13:49 (UTC)

Dependencies (2)

Required by (2)

Sources (4)

Latest Comments

1 2 3 4 5 6 7 Next › Last »

krutoileshii commented on 2024-03-01 01:27 (UTC) (edited on 2024-03-01 01:29 (UTC) by krutoileshii)

diff of the update to the latest + adding python back in

--- PKGBUILD.orig   2023-09-29 07:49:21.000000000 -0600
+++ PKGBUILD    2024-02-29 18:22:53.703306712 -0700
@@ -5,7 +5,7 @@
 # Contributor: Emanuel Fontelles ("emanuelfontelles") <emanuelfontelles@hotmail.com>

 pkgname=apache-spark
-pkgver=3.5.0
+pkgver=3.5.1
 pkgrel=1
 pkgdesc="A unified analytics engine for large-scale data processing"
 arch=('any')
@@ -17,7 +17,7 @@
         'apache-spark.sh'
         'apache-spark-master.service'
         'apache-spark-worker@.service')
-sha256sums=('ac064ec620129b4b9630e180e25ac2b71a8ccd4c6912bf5b5631943b742f0379'
+sha256sums=('5df15f8027067c060fe47ebd351a1431a61dbecc9c28b8dd29e2c6e1935c23eb'
             '0cc82baad4d878d4e2bc5864a00b99d38f2906781ea47ee6282546788e797049'
             'de54c025ca8ce34a7b4fd95ec7b8d5dec44582787a0bd8da09232f26e2182c9a'
             '47e6c154daecf7631ac9a33fe53a76888070c823c4381fcbde8d98377e586505')
@@ -28,7 +28,7 @@
     cd spark-${pkgver}-bin-hadoop3

     # Remove Python and R files
-    rm -rf python R
+    #rm -rf python R

     # Remove windows batch files
     rm bin/*.cmd

ViToni commented on 2024-02-10 19:59 (UTC)

Pyspark is also broken for me. Seems this package is supposed to be only for cluster installation which is inconvenient, when one want to work also locally...

lllf commented on 2023-10-09 12:39 (UTC) (edited on 2023-10-09 12:42 (UTC) by lllf)

Why are the Python and R files removed? This breaks the pyspark command.

prepare() {
    ...
    # Remove Python and R files
    rm -rf python R
}
$ pyspark
Python 3.11.3 (main, Jun  5 2023, 09:32:32) [GCC 13.1.1 20230429] on linux
Type "help", "copyright", "credits" or "license" for more information.
Could not open PYTHONSTARTUP
FileNotFoundError: [Errno 2] No such file or directory: '/opt/apache-spark/python/pyspark/shell.py'
>>> 

ttc0419 commented on 2023-06-24 12:38 (UTC)

@agaskell Updated

agaskell commented on 2023-06-23 18:43 (UTC)

@tcc0419 @PolarianDev the latest version of Spark is 3.4.1.

PolarianDev commented on 2023-03-26 12:03 (UTC)

@ttc0419 thank you, I might update it later today but I am also busy at the moment :)

ttc0419 commented on 2023-03-25 08:10 (UTC)

@PolarianDev Added you as co-maintainer, feel free to update it first

PolarianDev commented on 2023-03-23 14:31 (UTC)

@ttc0419 I see in the archives you were the one who requested this to be orphaned, I assume you want the package?

If you do want the package can I have co-maintainer?

If you don't want the package can I claim Maintainer?

PolarianDev commented on 2023-03-23 14:18 (UTC)

Someone orphan requested it, do they mind if I take the package?

dmfay commented on 2020-04-22 20:13 (UTC)

For 2.4.5 (also fixes the worker unit description):

diff --git a/PKGBUILD b/PKGBUILD
index 54ec365..14d1180 100644
--- a/PKGBUILD
+++ b/PKGBUILD
@@ -3,7 +3,7 @@
 # Contributor: Emanuel Fontelles ("emanuelfontelles") <emanuelfontelles@hotmail.com>

 pkgname=apache-spark
-pkgver=2.4.4
+pkgver=2.4.5
 pkgrel=1
 pkgdesc="fast and general engine for large-scale data processing"
 arch=('any')
@@ -26,7 +26,7 @@ source=("https://archive.apache.org/dist/spark/spark-${pkgver}/spark-${pkgver}-b
         'spark-daemon-run.sh'
         'run-master.sh'
         'run-slave.sh')
-sha1sums=('53f99ba8c5a68c941dd17d45393a6040dd0b46c8'
+sha1sums=('338756ea89c2d15985ee24b46cec21bf9c7f2622'
           'ac71d12070a9a10323e8ec5aed4346b1dd7f21c6'
           'a191e4f8f7f8bbc596f4fadfb3c592c3efbc4fc0'
           '3fa39d55075d4728bd447692d648053c9f6b07ec'
diff --git a/apache-spark-slave@.service b/apache-spark-slave@.service
index 453b346..a90e866 100644
--- a/apache-spark-slave@.service
+++ b/apache-spark-slave@.service
@@ -1,5 +1,5 @@
 [Unit]
-Description=Apache Spark Standalone Master
+Description=Apache Spark Worker
 After=network.target

 [Service]