Results 1 to 9 of 9

Thread: Beagle not finding pdfs, odt, etc... in 11.2

  1. #1

    Default Beagle not finding pdfs, odt, etc... in 11.2

    Hello,

    I had beagle up and running in under Suse 11.0 with no issues, basically it would do a great job of searching within openoffice files, filenames, within pdfs, within thunderbird emails, etc....

    I have added the appropriate file locations, I have everything selected to index, AND the service is running.

    Beagle isnt getting into the filenames, within pdfs, or openoffice items at all.

    Any Suggestions?
    Thank you for your time in this matter.

    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  2. #2

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    anyone?

    Thanks

    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  3. #3

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    I don't have this issue in 11.2, x86_64.

    However, you could try to run "beagle-extract-content" on one of the pdf files and see if it can find a filter for it.

    Second, you could check in /etc/beagle/crawl-rules if pdf is enabled (system wide setup).

    Third, you could check in $HOME/.beagle/config for suspicious excludes.

    Question: do you use the Thunderbird backend of beagle ? This is what doesn't return any result in my case ...

  4. #4

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    sorry for not responding sooner. Hopefully today I will get back into the beagle issue that I am having:

    my searching in thunderbird works fine, however
    how do I tell if I am ACTUALLY using the thunderbird backend of beagle (i didnt see anything in the EDIT > PREFERENCES

    YAST> Software does have the following installed:
    Mozilla Thunderbird (3.0.0-1.1.1)
    beagle-thunderbird (0.3.9-9.10.4)

    regards
    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  5. #5

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    My intent is to search /archive and /media/ebooks
    specifically file names, within (pdf, odt, xls, doc, txt, etc)

    In the beagle search gui (0.3.9), SEARCH > PREFERENCES:

    SEARCHING TAB: all items selected:
    Start search & Indexing services automatically
    Index while on battery power
    index more aggressively while screensaver is activated

    IDEXING TAB: Selected to index my home directory and additional:
    /home/qu1nn
    /archive
    /media/ebooks

    Privacy :
    Path: $HOME/tmp
    Path: $BEAGLE_HOME/tmp
    Pattern(s): *~, #*#, *.o, *.a, *.S, *.la, *.lo, *.loT, *.so, *.exe, *.dll, *.mdb, *.com, *.csproj, *.dsp, *.dsw, *.m4, *.pc, *.pc.in, *.in.in, *.omf, *.aux, *.tmp, autom4te.cache, po, aclocal, Makefile, Makefile.am, Makefile.in, CVS, SCCS, conftest, conftest.*, /conf[0-9]+.sh/, /conf[0-9]+.file/

    MAIL FOLDER (s): spam, outbox, trash, drafts, templates

    DATA SOURCES TAB: Selected:
    Thunderbird
    Files
    Kmail
    Applications
    documentation
    manpages
    locate
    monodoc
    windows

    NETWORK OPTIONS TAB: nothing is selected


    From what I see, the settings in the gui dont seem to match those in /etc/beagle/crawl-rules, as shown below:


    /etc/beagle/crawl-rules/crawl-applications
    CRAWL_ENABLED="yes"
    CRAWL_INDEX_NAME="applications"
    CRAWL_PATHS="/usr/share/applications /usr/local/share/applications /opt/gnome/share/applications /opt/kde3/share/applications"
    CRAWL_RECURSIVE="yes"
    CRAWL_DISABLE_FILTERING="no"
    CRAWL_DISABLE_DIRECTORIES="yes"
    CRAWL_DISABLE_ON_BATTERY="yes"
    CRAWL_CACHE_TEXT="no"
    CRAWL_REMOVE_DELETED_FILES="yes"

    /etc/beagle/crawl-rules/crawl-documentation
    CRAWL_ENABLED="yes"
    CRAWL_INDEX_NAME="documentation"
    CRAWL_PATHS="/usr/share/doc /usr/local/share/doc /opt/kde3/share/doc /opt/kde3/share/doc-bundle /opt/gnome/share/gnome/help /usr/share/gnome/help /usr/share/gnome/help-bundle /opt/gnome/share/gtk-doc/html /usr/share/gtk-doc/html /usr/share/gnome/html"
    CRAWL_ALLOW_PATTERNS="*.xml,*.html,*.docbook,*.pdf"
    CRAWL_RECURSIVE="yes"
    CRAWL_DISABLE_FILTERING="no"
    CRAWL_DISABLE_DIRECTORIES="yes"
    CRAWL_DISABLE_ON_BATTERY="yes"
    CRAWL_CACHE_TEXT="no"
    CRAWL_REMOVE_DELETED_FILES="yes"


    /etc/beagle/crawl-rules/crawl-executables
    CRAWL_ENABLED="no"
    # Use the same index as the applications index
    CRAWL_INDEX_NAME="applications"
    CRAWL_PATHS="/bin /sbin /usr/bin /usr/sbin /usr/local/bin/ /usr/local/sbin"
    # Only index files in the gives directories
    CRAWL_RECURSIVE="no"
    # Don't index content
    CRAWL_DISABLE_FILTERING="yes"
    CRAWL_DISABLE_DIRECTORIES="yes"
    CRAWL_DISABLE_ON_BATTERY="yes"
    CRAWL_CACHE_TEXT="no"
    CRAWL_REMOVE_DELETED_FILES="yes"



    /etc/beagle/crawl-rules/crawl-manpages
    CRAWL_ENABLED="yes"
    CRAWL_INDEX_NAME="manpages"
    CRAWL_PATHS="/usr/share/man/man1 /usr/share/man/man2 /usr/share/man/man3 /usr/share/man/man4 /usr/share/man/man5 /usr/share/man/man6 /usr/share/man/man7 /usr/share/man/man8 /usr/share/man/man9 /usr/share/man/mann /usr/share/info"
    CRAWL_RECURSIVE="yes"
    CRAWL_DISABLE_FILTERING="no"
    CRAWL_DISABLE_DIRECTORIES="yes"
    CRAWL_DISABLE_ON_BATTERY="yes"
    CRAWL_CACHE_TEXT="no"
    CRAWL_REMOVE_DELETED_FILES="yes"


    /etc/beagle/crawl-rules/crawl-manpages
    CRAWL_ENABLED="no"
    CRAWL_INDEX_NAME="monodoc"
    CRAWL_PATHS="/usr/lib/monodoc/sources /usr/local/lib/monodoc/sources"
    # Indexing Mono.zip will take a long time. It might be a good idea to add Mono.zip to deny patterns.
    CRAWL_DENY_PATTERNS="*.source,*.tree"
    CRAWL_RECURSIVE="yes"
    CRAWL_DISABLE_FILTERING="no"
    CRAWL_DISABLE_DIRECTORIES="yes"
    CRAWL_DISABLE_ON_BATTERY="yes"
    CRAWL_CACHE_TEXT="no"
    CRAWL_REMOVE_DELETED_FILES="yes"






    suggestions?
    regards and thanks for your time in this
    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  6. #6

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    still at a loss: Here is a copy of ~//.beagle/Log ... There are several files, not sure which to pick:


    CURRENT-BEAGLE:
    20091226 16:12:06.8195 03836 Beagle INFO: Starting Beagle Daemon (version 0.3.9)
    20091226 16:12:06.8782 03836 Beagle INFO: Running on Mono 2.4.2.3
    20091226 16:12:06.9476 03836 Beagle INFO: Command Line: /usr/lib64/beagle/BeagleDaemon.exe --replace --bg
    20091226 16:12:08.7228 03836 Beagle WARN: Found a dangling index lock on /home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock.
    20091226 16:12:08.7237 03836 Beagle WARN: Found a dangling index lock on /home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-a92a5ffb4516b03be63a247727e81146-write.lock.
    20091226 16:12:08.7828 03836 Beagle WARN: Indexes verified in .06s. Deleting stale lock files.
    20091226 16:12:08.7847 03836 Beagle WARN: Could not delete lock files.
    20091226 16:12:08.7947 03836 Beagle ERROR EX: Caught exception while instantiating Files backend
    20091226 16:12:08.7947 03836 Beagle ERROR EX: System.Reflection.TargetInvocationException: Exception has been thrown by the target of an invocation. ---> System.UnauthorizedAccessException: Access to the path "/home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock" is denied.
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.File.Delete (System.String path) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.IO.Directory.Delete (System.String path, Boolean recursive) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneCommon.Create () [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryingDriver..ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.LuceneNameResolver..ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable.BuildLuceneQueryingDriver (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable..ctor (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable..ctor (System.String index_name, Int32 minor_version) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable..ctor () [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at (wrapper managed-to-native) System.Reflection.MonoCMethod:InternalInvoke (object,object[],System.Exception&)
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: --- End of inner exception stack trace ---
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Reflection.ConstructorInfo.Invoke (System.Object[] parameters) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type, Boolean nonPublic) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type) [0x00000]
    20091226 16:12:08.7947 03836 Beagle ERROR EX: at Beagle.Daemon.QueryDriver.ScanAssemblyForQueryables (System.Reflection.Assembly assembly) [0x00000]
    20091229 00:40:12.7061 03836 Beagle INFO: Shutdown requested
    20091229 00:40:12.7426 03836 Beagle INFO: Beagle daemon process shut down cleanly.




    current-BEAGLE-EXTENSIONS:
    20091229 07:59:15.4060 03739 Beagle ERROR EX: Caught exception while instantiating Files backend
    20091229 07:59:15.4060 03739 Beagle ERROR EX: System.Reflection.TargetInvocationException: Exception has been thrown by the target of an invocation. ---> System.UnauthorizedAccessException: Access to the path "/home/qu1nn/.beagle/Indexes/FileSystemIndex/Locks/lucene-0b4be62ddabcdfcc83b8691eda76a2b7-write.lock" is denied.
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.File.Delete (System.String path) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.RecursiveDelete (System.String path) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.IO.Directory.Delete (System.String path, Boolean recursive) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneCommon.Create () [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryingDriver..ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.LuceneNameResolver..ctor (System.String index_name, Int32 minor_version, Boolean read_only) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable.BuildLuceneQueryingDriver (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable..ctor (System.String index_name, Int32 minor_version, Boolean read_only_mode) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.LuceneQueryable..ctor (System.String index_name, Int32 minor_version) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.FileSystemQueryable.FileSystemQueryable..ctor () [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at (wrapper managed-to-native) System.Reflection.MonoCMethod:InternalInvoke (object,object[],System.Exception&)
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: --- End of inner exception stack trace ---
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (System.Object obj, BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.MonoCMethod.Invoke (BindingFlags invokeAttr, System.Reflection.Binder binder, System.Object[] parameters, System.Globalization.CultureInfo culture) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Reflection.ConstructorInfo.Invoke (System.Object[] parameters) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type, Boolean nonPublic) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at System.Activator.CreateInstance (System.Type type) [0x00000]
    20091229 07:59:15.4060 03739 Beagle ERROR EX: at Beagle.Daemon.QueryDriver.ScanAssemblyForQueryables (System.Reflection.Assembly assembly) [0x00000]




    I am at a loss.
    regards
    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  7. #7

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    Items installed:
    Need extended attributes
    multipe drives
    /dev/sda2 [ext4] / fstab options: extended user attributes selected
    data journaling mode = ordered
    Access Control Lists selected

    /dev/sda3 [ext4] /home (fstab options are the same as above)
    /dev/sdb1 [ext4] /archive (fstab options are the same as above)
    /dev/sdc1 [fat] /media/disk


    Installed Software [version]:
    openSUSE 11.2 (x86_64)
    KDE 4.3.1 release 6

    beagle-gui [0.3.9-9.10.4]
    beagle-firefox [0.3.9-9.10.4]
    beagle-index [11.2_20090721-3.12]
    beagle-thunderbird [0.3.9-9.10.4]
    beagle-xesam [0.2-145.5]

    perl-Linux-Inotify2 [1.1-131.1]
    perl-Linux-Inotify2-debuginfo [1.1-131.1]
    perl-Linux-Inotify2-debufsource [1.1-131.1]
    python-pyinotify [0.8.1-0.pm.1.1]

    libbeagle-devel [0.3.9-3.3]
    libbeagle-doc [0.3.9-3.3]
    libbeagle-python [0.3.9-3.3]
    libbeagle-python-debuginfo [0.3.9-3.3]
    libeagle1 [0.3.9-3.3]


    Mono-core [2.4.2.3-2.3]
    Mono-data [2.4.2.4-2.3]
    Mono-data-sqlite [2.4.2.4-2.3]
    mono-devel [2.4.2.4-2.3]
    mono-web [2.4.2.4-2.3]

    glib-sharp2 [2.12.9-23.6]
    NDesk-Dbus [0.6.0-65.3]
    gmime [2.2.23-4.4]
    sqlite3 [3.6.16-2.3]
    taglib-sharp [2.0.3.3-1.1.2]
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  8. #8

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    anyone have any pointers?


    qu1nn
    Pentium D 820, 2.80GHz, Intel D945PSN ATX, PowerColor Radeon HD7870, Geil DDR2, (1GB x 2) Dual Channel 667MHz Suse 13.2, KDE
    Haswell i4771, 3.5GHz, Asus Z87-WS ATX, Club3d R9 280X RoyalQueen, Corsair 8GBx2, Corsair AX860 PS, openSuse 13.1, KDE

  9. #9

    Default Re: Beagle not finding pdfs, odt, etc... in 11.2

    Please try the following:

    1. Search path [as root]

    Delete all not valid path entries in /etc/beagle/crawl-rules/crawl-documentation and add the paths to be crawled in addition. Note: Paths you already added in the GUI Kerry will not be written in the /etc/beagle/crawl-rules/crawl-documentation file. I think it's a problem of the access rights.

    2. Old indexes [as user user]

    Delete all in the folder /home/user/.beagle/Indexes

    3. New indexes [as user user]

    Log off and on your session. Beagle will start to build new indexes including your desired files.

    Good luck!

    albschrat

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •