Package Details: apache-spark 3.5.0-1

Git Clone URL: (read-only, click to copy)
Package Base: apache-spark
Description: A unified analytics engine for large-scale data processing
Upstream URL:
Keywords: spark
Licenses: Apache
Submitter: huitseeker
Maintainer: ttc0419
Last Packager: ttc0419
Votes: 57
Popularity: 0.071614
First Submitted: 2015-10-04 09:31 (UTC)
Last Updated: 2023-09-29 13:49 (UTC)

Dependencies (2)

Required by (2)

Sources (4)

Latest Comments

1 2 3 4 5 6 7 Next › Last »

ViToni commented on 2024-02-10 19:59 (UTC)

Pyspark is also broken for me. Seems this package is supposed to be only for cluster installation which is inconvenient, when one want to work also locally...

lllf commented on 2023-10-09 12:39 (UTC) (edited on 2023-10-09 12:42 (UTC) by lllf)

Why are the Python and R files removed? This breaks the pyspark command.

prepare() {
    # Remove Python and R files
    rm -rf python R
$ pyspark
Python 3.11.3 (main, Jun  5 2023, 09:32:32) [GCC 13.1.1 20230429] on linux
Type "help", "copyright", "credits" or "license" for more information.
Could not open PYTHONSTARTUP
FileNotFoundError: [Errno 2] No such file or directory: '/opt/apache-spark/python/pyspark/'

ttc0419 commented on 2023-06-24 12:38 (UTC)

@agaskell Updated

agaskell commented on 2023-06-23 18:43 (UTC)

@tcc0419 @PolarianDev the latest version of Spark is 3.4.1.

PolarianDev commented on 2023-03-26 12:03 (UTC)

@ttc0419 thank you, I might update it later today but I am also busy at the moment :)

ttc0419 commented on 2023-03-25 08:10 (UTC)

@PolarianDev Added you as co-maintainer, feel free to update it first

PolarianDev commented on 2023-03-23 14:31 (UTC)

@ttc0419 I see in the archives you were the one who requested this to be orphaned, I assume you want the package?

If you do want the package can I have co-maintainer?

If you don't want the package can I claim Maintainer?

PolarianDev commented on 2023-03-23 14:18 (UTC)

Someone orphan requested it, do they mind if I take the package?

dmfay commented on 2020-04-22 20:13 (UTC)

For 2.4.5 (also fixes the worker unit description):

diff --git a/PKGBUILD b/PKGBUILD
index 54ec365..14d1180 100644
@@ -3,7 +3,7 @@
 # Contributor: Emanuel Fontelles ("emanuelfontelles") <>

 pkgdesc="fast and general engine for large-scale data processing"
@@ -26,7 +26,7 @@ source=("${pkgver}/spark-${pkgver}-b
diff --git a/apache-spark-slave@.service b/apache-spark-slave@.service
index 453b346..a90e866 100644
--- a/apache-spark-slave@.service
+++ b/apache-spark-slave@.service
@@ -1,5 +1,5 @@
-Description=Apache Spark Standalone Master
+Description=Apache Spark Worker


ryukinix commented on 2020-01-17 23:12 (UTC) (edited on 2020-01-17 23:13 (UTC) by ryukinix)

Updating to spark 3.0.0-preview2 it will make works with Python3.8. I'm using this modified version of PKGBUILD: Until now it's working fine.

5 days ago it was released the v2.4.5 github tag, but is not available yet on apache spark archive the compiled version, for that reason I used 3.0.0 preview2 version which is the most recent version available in the archive.

v2.4.5 and v3.0.0 versions contains the commit that fix the problems with python3.8: