Package: stringi 1.8.4.9001
stringi: Fast and Portable Character String Processing Facilities
A collection of character string/text/natural language processing tools for pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, sorting, padding, wrapping, Unicode normalisation, date-time formatting and parsing, and many more. They are fast, consistent, convenient, and - thanks to 'ICU' (International Components for Unicode) - portable across all locales and platforms. Documentation about 'stringi' is provided via its website at <https://stringi.gagolewski.com/> and the paper by Gagolewski (2022, <doi:10.18637/jss.v103.i02>).
Authors:
stringi_1.8.4.9001.tar.gz
stringi_1.8.4.9001.zip(r-4.5)stringi_1.8.4.9001.zip(r-4.4)stringi_1.8.4.9001.zip(r-4.3)
stringi_1.8.4.9001.tgz(r-4.4-x86_64)stringi_1.8.4.9001.tgz(r-4.4-arm64)stringi_1.8.4.9001.tgz(r-4.3-x86_64)stringi_1.8.4.9001.tgz(r-4.3-arm64)
stringi_1.8.4.9001.tar.gz(r-4.5-noble)stringi_1.8.4.9001.tar.gz(r-4.4-noble)
stringi_1.8.4.9001.tgz(r-4.4-emscripten)stringi_1.8.4.9001.tgz(r-4.3-emscripten)
stringi.pdf |stringi.html✨
stringi/json (API)
NEWS
# Install 'stringi' in R: |
install.packages('stringi', repos = c('https://gagolews.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/gagolews/stringi/issues
icuicu4cnatural-language-processingnlpregexregexpstring-manipulationstringistringrtexttext-processingtidy-dataunicode
Last updated 4 months agofrom:1304a6c35b. Checks:OK: 4 NOTE: 5. Indexed: yes.
Target | Result | Date |
---|---|---|
Doc / Vignettes | OK | Nov 08 2024 |
R-4.5-win-x86_64 | NOTE | Nov 08 2024 |
R-4.5-linux-x86_64 | NOTE | Nov 08 2024 |
R-4.4-win-x86_64 | NOTE | Nov 08 2024 |
R-4.4-mac-x86_64 | NOTE | Nov 08 2024 |
R-4.4-mac-aarch64 | NOTE | Nov 08 2024 |
R-4.3-win-x86_64 | OK | Nov 08 2024 |
R-4.3-mac-x86_64 | OK | Nov 08 2024 |
R-4.3-mac-aarch64 | OK | Nov 08 2024 |
Exports:%s!=%%s!==%%s*%%s+%%s<%%s<=%%s==%%s===%%s>%%s>=%%s$%%stri!=%%stri!==%%stri*%%stri+%%stri<%%stri<=%%stri==%%stri===%%stri>%%stri>=%%stri$%stri_cstri_c_liststri_cmpstri_cmp_eqstri_cmp_equivstri_cmp_gestri_cmp_gtstri_cmp_lestri_cmp_ltstri_cmp_neqstri_cmp_nequivstri_collstri_comparestri_convstri_countstri_count_boundariesstri_count_charclassstri_count_collstri_count_fixedstri_count_regexstri_count_wordsstri_datetime_addstri_datetime_add<-stri_datetime_createstri_datetime_fieldsstri_datetime_formatstri_datetime_fstrstri_datetime_nowstri_datetime_parsestri_datetime_symbolsstri_detectstri_detect_charclassstri_detect_collstri_detect_fixedstri_detect_regexstri_dupstri_duplicatedstri_duplicated_anystri_enc_detectstri_enc_detect2stri_enc_fromutf32stri_enc_getstri_enc_infostri_enc_isasciistri_enc_isutf16bestri_enc_isutf16lestri_enc_isutf32bestri_enc_isutf32lestri_enc_isutf8stri_enc_liststri_enc_markstri_enc_setstri_enc_toasciistri_enc_tonativestri_enc_toutf32stri_enc_toutf8stri_encodestri_endswithstri_endswith_charclassstri_endswith_collstri_endswith_fixedstri_escape_unicodestri_extractstri_extract_allstri_extract_all_boundariesstri_extract_all_charclassstri_extract_all_collstri_extract_all_fixedstri_extract_all_regexstri_extract_all_wordsstri_extract_firststri_extract_first_boundariesstri_extract_first_charclassstri_extract_first_collstri_extract_first_fixedstri_extract_first_regexstri_extract_first_wordsstri_extract_laststri_extract_last_boundariesstri_extract_last_charclassstri_extract_last_collstri_extract_last_fixedstri_extract_last_regexstri_extract_last_wordsstri_flattenstri_infostri_isemptystri_joinstri_join_liststri_lengthstri_list2matrixstri_locale_getstri_locale_infostri_locale_liststri_locale_setstri_locatestri_locate_allstri_locate_all_boundariesstri_locate_all_charclassstri_locate_all_collstri_locate_all_fixedstri_locate_all_regexstri_locate_all_wordsstri_locate_firststri_locate_first_boundariesstri_locate_first_charclassstri_locate_first_collstri_locate_first_fixedstri_locate_first_regexstri_locate_first_wordsstri_locate_laststri_locate_last_boundariesstri_locate_last_charclassstri_locate_last_collstri_locate_last_fixedstri_locate_last_regexstri_locate_last_wordsstri_matchstri_match_allstri_match_all_regexstri_match_firststri_match_first_regexstri_match_laststri_match_last_regexstri_na2emptystri_numbytesstri_omit_emptystri_omit_empty_nastri_omit_nastri_opts_brkiterstri_opts_collatorstri_opts_fixedstri_opts_regexstri_orderstri_padstri_pad_bothstri_pad_leftstri_pad_rightstri_pastestri_paste_liststri_printfstri_rand_lipsumstri_rand_shufflestri_rand_stringsstri_rankstri_read_linesstri_read_rawstri_remove_emptystri_remove_empty_nastri_remove_nastri_replacestri_replace_allstri_replace_all_charclassstri_replace_all_collstri_replace_all_fixedstri_replace_all_regexstri_replace_firststri_replace_first_charclassstri_replace_first_collstri_replace_first_fixedstri_replace_first_regexstri_replace_laststri_replace_last_charclassstri_replace_last_collstri_replace_last_fixedstri_replace_last_regexstri_replace_nastri_replace_rstrstri_reversestri_sortstri_sort_keystri_splitstri_split_boundariesstri_split_charclassstri_split_collstri_split_fixedstri_split_linesstri_split_lines1stri_split_regexstri_sprintfstri_startswithstri_startswith_charclassstri_startswith_collstri_startswith_fixedstri_stats_generalstri_stats_latexstri_string_formatstri_substri_sub_allstri_sub_all_replacestri_sub_all<-stri_sub_replacestri_sub_replace_allstri_sub<-stri_subsetstri_subset_charclassstri_subset_charclass<-stri_subset_collstri_subset_coll<-stri_subset_fixedstri_subset_fixed<-stri_subset_regexstri_subset_regex<-stri_subset<-stri_timezone_getstri_timezone_infostri_timezone_liststri_timezone_setstri_trans_casefoldstri_trans_charstri_trans_generalstri_trans_isnfcstri_trans_isnfdstri_trans_isnfkcstri_trans_isnfkc_casefoldstri_trans_isnfkdstri_trans_liststri_trans_nfcstri_trans_nfdstri_trans_nfkcstri_trans_nfkc_casefoldstri_trans_nfkdstri_trans_tolowerstri_trans_totitlestri_trans_toupperstri_trimstri_trim_bothstri_trim_leftstri_trim_rightstri_unescape_unicodestri_uniquestri_widthstri_wrapstri_write_lines
Dependencies:
Readme and manuals
Help Manual
Help page | Topics |
---|---|
Concatenate Two Character Vectors | %s+% %stri+% operator_add operator_plus oper_plus |
Compare Strings with or without Collation | %s!=% %s!==% %s<% %s<=% %s==% %s===% %s>% %s>=% %stri!=% %stri!==% %stri<% %stri<=% %stri==% %stri===% %stri>% %stri>=% operator_compare oper_compare oper_comparison |
C-Style Formatting with 'stri_sprintf' as a Binary Operator | %s$% %stri$% operator_dollar oper_dollar |
Passing Arguments to Functions in 'stringi' | about_arguments arguments stringi-arguments |
Character Encodings and 'stringi' | about_encoding encoding stringi-encoding |
Locales and 'stringi' | about_locale locale stringi-locale |
String Searching | about_search search stringi-search |
Text Boundary Analysis in 'stringi' | about_search_boundaries search_boundaries stringi-search-boundaries |
Character Classes in 'stringi' | about_search_charclass search_charclass stringi-search-charclass |
Locale-Sensitive Text Searching in 'stringi' | about_search_coll search_coll stringi-search-coll |
Locale-Insensitive Fixed Pattern Matching in 'stringi' | about_search_fixed search_fixed stringi-search-fixed |
Regular Expressions in 'stringi' | about_search_regex search_regex stringi-search-regex |
Fast and Portable Character String Processing in R | -package stringi-package about_stringi stringi |
Compare Strings with or without Collation | stri_cmp stri_cmp_eq stri_cmp_equiv stri_cmp_ge stri_cmp_gt stri_cmp_le stri_cmp_lt stri_cmp_neq stri_cmp_nequiv stri_compare |
Count the Number of Pattern Occurrences | stri_count stri_count_charclass stri_count_coll stri_count_fixed stri_count_regex |
Count the Number of Text Boundaries | stri_count_boundaries stri_count_words |
Date and Time Arithmetic | stri_datetime_add stri_datetime_add<- |
Create a Date-Time Object | stri_datetime_create |
Get Values for Date and Time Fields | stri_datetime_fields |
Date and Time Formatting and Parsing | stri_datetime_format stri_datetime_parse |
Convert 'strptime'-Style Format Strings | stri_datetime_fstr |
Get Current Date and Time | stri_datetime_now |
List Localizable Date-Time Formatting Data | stri_datetime_symbols |
Detect Pattern Occurrences | stri_detect stri_detect_charclass stri_detect_coll stri_detect_fixed stri_detect_regex |
Duplicate Strings | %s*% %stri*% operator_multiply oper_multiply stri_dup |
Determine Duplicated Elements | stri_duplicated stri_duplicated_any |
Detect Character Set and Language | stri_enc_detect |
[DEPRECATED] Detect Locale-Sensitive Character Encoding | stri_enc_detect2 |
Convert From UTF-32 | stri_enc_fromutf32 |
Query a Character Encoding | stri_enc_info |
Check If a Data Stream Is Possibly in ASCII | stri_enc_isascii |
Check If a Data Stream Is Possibly in UTF-16 or UTF-32 | stri_enc_isutf16be stri_enc_isutf16le stri_enc_isutf32be stri_enc_isutf32le |
Check If a Data Stream Is Possibly in UTF-8 | stri_enc_isutf8 |
List Known Character Encodings | stri_enc_list |
Get Declared Encodings of Each String | stri_enc_mark |
Set or Get Default Character Encoding in 'stringi' | stri_enc_get stri_enc_set |
Convert To ASCII | stri_enc_toascii |
Convert Strings To Native Encoding | stri_enc_tonative |
Convert Strings To UTF-32 | stri_enc_toutf32 |
Convert Strings To UTF-8 | stri_enc_toutf8 |
Convert Strings Between Given Encodings | stri_conv stri_encode |
Escape Unicode Code Points | stri_escape_unicode |
Extract Pattern Occurrences | stri_extract stri_extract_all stri_extract_all_charclass stri_extract_all_coll stri_extract_all_fixed stri_extract_all_regex stri_extract_first stri_extract_first_charclass stri_extract_first_coll stri_extract_first_fixed stri_extract_first_regex stri_extract_last stri_extract_last_charclass stri_extract_last_coll stri_extract_last_fixed stri_extract_last_regex |
Extract Data Between Text Boundaries | stri_extract_all_boundaries stri_extract_all_words stri_extract_first_boundaries stri_extract_first_words stri_extract_last_boundaries stri_extract_last_words |
Flatten a String | stri_flatten |
Query Default Settings for 'stringi' | stri_info |
Determine if a String is of Length Zero | stri_isempty |
Concatenate Character Vectors | stri_c stri_join stri_paste |
Concatenate Strings in a List | stri_c_list stri_join_list stri_paste_list |
Count the Number of Code Points | stri_length |
Convert a List to a Character Matrix | stri_list2matrix |
Query Given Locale | stri_locale_info |
List Available Locales | stri_locale_list |
Set or Get Default Locale in 'stringi' | stri_locale_get stri_locale_set |
Locate Pattern Occurrences | stri_locate stri_locate_all stri_locate_all_charclass stri_locate_all_coll stri_locate_all_fixed stri_locate_all_regex stri_locate_first stri_locate_first_charclass stri_locate_first_coll stri_locate_first_fixed stri_locate_first_regex stri_locate_last stri_locate_last_charclass stri_locate_last_coll stri_locate_last_fixed stri_locate_last_regex |
Locate Text Boundaries | stri_locate_all_boundaries stri_locate_all_words stri_locate_first_boundaries stri_locate_first_words stri_locate_last_boundaries stri_locate_last_words |
Extract Regex Pattern Matches, Together with Capture Groups | stri_match stri_match_all stri_match_all_regex stri_match_first stri_match_first_regex stri_match_last stri_match_last_regex |
Replace NAs with Empty Strings | stri_na2empty |
Count the Number of Bytes | stri_numbytes |
Generate a List with BreakIterator Settings | stri_opts_brkiter |
Generate a List with Collator Settings | stri_coll stri_opts_collator |
Generate a List with Fixed Pattern Search Engine's Settings | stri_opts_fixed |
Generate a List with Regex Matcher Settings | stri_opts_regex |
Ordering Permutation | stri_order |
Pad (Center/Left/Right Align) a String | stri_pad stri_pad_both stri_pad_left stri_pad_right |
A Lorem Ipsum Generator | stri_rand_lipsum |
Randomly Shuffle Code Points in Each String | stri_rand_shuffle |
Generate Random Strings | stri_rand_strings |
Ranking | stri_rank |
Read Text Lines from a Text File | stri_read_lines |
Read Text File as Raw | stri_read_raw |
Remove All Empty Strings from a Character Vector | stri_omit_empty stri_omit_empty_na stri_omit_na stri_remove_empty stri_remove_empty_na stri_remove_na |
Replace Pattern Occurrences | stri_replace stri_replace_all stri_replace_all_charclass stri_replace_all_coll stri_replace_all_fixed stri_replace_all_regex stri_replace_first stri_replace_first_charclass stri_replace_first_coll stri_replace_first_fixed stri_replace_first_regex stri_replace_last stri_replace_last_charclass stri_replace_last_coll stri_replace_last_fixed stri_replace_last_regex |
Replace Missing Values in a Character Vector | stri_replace_na |
Convert gsub-Style Replacement Strings | stri_replace_rstr |
Reverse Each String | stri_reverse |
String Sorting | stri_sort |
Sort Keys | stri_sort_key |
Split a String By Pattern Matches | stri_split stri_split_charclass stri_split_coll stri_split_fixed stri_split_regex |
Split a String at Text Boundaries | stri_split_boundaries |
Split a String Into Text Lines | stri_split_lines stri_split_lines1 |
Format Strings | stri_printf stri_sprintf stri_string_format |
Determine if the Start or End of a String Matches a Pattern | stri_endswith stri_endswith_charclass stri_endswith_coll stri_endswith_fixed stri_startswith stri_startswith_charclass stri_startswith_coll stri_startswith_fixed |
General Statistics for a Character Vector | stri_stats_general |
Statistics for a Character Vector Containing LaTeX Commands | stri_stats_latex |
Extract a Substring From or Replace a Substring In a Character Vector | stri_sub stri_sub<- stri_sub_replace |
Extract or Replace Multiple Substrings | stri_sub_all stri_sub_all<- stri_sub_all_replace stri_sub_replace_all |
Select Elements that Match a Given Pattern | stri_subset stri_subset<- stri_subset_charclass stri_subset_charclass<- stri_subset_coll stri_subset_coll<- stri_subset_fixed stri_subset_fixed<- stri_subset_regex stri_subset_regex<- |
Set or Get Default Time Zone in 'stringi' | stri_timezone_get stri_timezone_set |
Query a Given Time Zone | stri_timezone_info |
List Available Time Zone Identifiers | stri_timezone_list |
Translate Characters | stri_trans_char |
General Text Transforms, Including Transliteration | stri_trans_general |
List Available Text Transforms and Transliterators | stri_trans_list |
Perform or Check For Unicode Normalization | stri_trans_isnfc stri_trans_isnfd stri_trans_isnfkc stri_trans_isnfkc_casefold stri_trans_isnfkd stri_trans_nfc stri_trans_nfd stri_trans_nfkc stri_trans_nfkc_casefold stri_trans_nfkd |
Transform Strings with Case Mapping or Folding | stri_trans_casefold stri_trans_tolower stri_trans_totitle stri_trans_toupper |
Trim Characters from the Left and/or Right Side of a String | stri_trim stri_trim_both stri_trim_left stri_trim_right |
Un-escape All Escape Sequences | stri_unescape_unicode |
Extract Unique Elements | stri_unique |
Determine the Width of Code Points | stri_width |
Word Wrap Text to Format Paragraphs | stri_wrap |
Write Text Lines to a Text File | stri_write_lines |